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THE NATURE OF THE EFFECTIVE STIMULUS IN 
ANIMAL DISCRIMINATION LEARNING: 
TRANSPOSITION RECONSIDERED 
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When an animal learns to discrimi- 
nate between two stimuli lying on the 
same physical dimension, what is the 
nature of the effective stimulus with 
which the response is associated? 

In 1937, Spence (13) presented a 
theory which assumed that the animal 
learns to approach the absolute proper- 
ties of the positive stimulus and that in- 
dependent of this, it learns not to ap- 
proach the absolute properties of the 
negative stimulus. In those situations 
in which the animal learns to discrimi- 
nate between two points on the same 
Physical dimension, Spence defined the 
effective stimulus in conventional physi- 
cal units—for example, the number of 
square centimeters in a size discrimina- 
tion, or the number of foot candles in 
a brightness discrimination. Spence con- 
trasted this absolute definition of the 
effective stimulus with Köhler’s earlier 
assumption (7) that the animal re- 
sponds to the relationship between the 
two stimuli, and learns to go to the 
larger or the brighter of the two stimuli. 
Spence made some additional assump- 
tions that enabled him to predict the 
transposition of the learned response to 
pairs of test stimuli which differ from 
the training stimuli. In contrast to the 


relational theory, which made no quan- 


titative predictions, Spence’s theory pre- 
dicted that transposition would decrease 
as the test stimuli became less and less 
similar to the training stimuli. This 
prediction has been confirmed in several 
experiments (1, 6, 9, 13). 

This paper will present evidence that 
Spence’s assumption that the animal re- 
sponds to the absolute properties of 
the stimulus, defined in units such as 
absolute size or luminance, is unten- 
able. This evidence also casts doubt 
on Spence’s explanation of the decline 
in transposition. Two alternative rein- 
terpretations of the facts of transposi- 
tion will then be considered. Before 
presenting the new evidence, Spence’s 
theory will be described, together with 
an example of the type of evidence sup- 
porting it. 

Near and far transposition tests. The 
meaning of the terms “near transposi- 
tion test” and “far transposition test” 
can be given schematically. Let letters 
A through J represent points on a physi- 
cal continuum, such that the ratio of B 
to A is the same as C to B, and so on, 
Tf the S is originally trained to choose 
B in the presence of A and B, then a 
test with B and C is a near transposi- 
tion test. A test with any other pair of 
adjacent, stjmuli (eg., H and I) is a 
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far transposition test. For example, if 
in original training the ratio of the 
magnitude of the positive to the nega- 
tive stimulus is 2/1, then in the near 
transposition test, the ratio will be 4/2. 
In the far transposition test, the ratio 
might be 64/32, or 128/64. Although 
Spence and others have discussed other 
combinations of stimuli, in the interest 
of simplicity and brevity, the discussion 
here will be restricted to the case in 
which the test stimuli are of greater 
magnitude than the training stimuli. 
The sample experiment which will 
now be described in this simplified form 
is one by Kendler (6). Two stimuli 
serving as cues to the presence or ab- 


sence of reward were presented to the- 
Ss (in this case, rats). These stimuli 


differed only in that one had a photo- 
metric value of .011 foot candle while 
the other had a value of .63 foot candle. 
Going to one of the stimuli (e.g., the 
brighter) was always followed by re- 
ward. Going to the other was followed 
by nonreward. After learning, the ani- 
mal was tested with other Pairs of 
stimuli. Different animals were tested 
at different intensities, but in each test 
the ratio between the two stimuli was 
the same as in original training. In 
each test, the animals preferred the 
more intense stimulus, but the magni- 
tude of the preference decreased as the 
intensity of the test pairs increased,? 
That is, far transposition is weaker than 
hear transposition, Similar results have 
been found with size as the critical di- 
mension (1, 9, 13). 

The absolute theory. Because Spence’s 
theory is given in detail in many refer- 
ences, it will be summarized here only 


1The amount of transposition in Kendler’s 
experiment did not decline monotonically 
when the test stimuli were more intense than 
the training stimuli. The animals transposed 
more to the most remote stimuli than to those 
of intermediate value. The reason for this is 
not clear, but one possibility is that the volt- 
age changes that were introduced changed the 
wave length of the light. This in turn might 
influence brightness discrimination. 


in very brief form. It assumes that if 
the animal is regularly rewarded when 
it approaches one point on a stimulus 
dimension, and is regularly nonrewarded 
when it approaches another point on 
the same dimension, the animal will de- 
velop corresponding approach (Excita- 
tory) and nonapproach (Inhibitory) 
tendencies. These E and I tendencies 
will be elicited by other points on the 
dimension, but the intensity of the 
elicited tendency will decline as the dis- 
tance of the test point from the training 
point increases. The shapes of the E 
and I gradients are positively deceler- 
ated, and, in the simplest case, have the 
same equation. The tendency to make 
an approach response to any point on 
the dimension will be a simple function 
of the difference between the two oppos- 
ing tendencies. Since, at points above 
the training stimuli on the stimulus con- 
tinuum, the I generalization gradient 
will fall more rapidly than the E gradi- 
ent, transposition will occur, As both 
curves fall, the greater E-minus-I value 
of the more intense stimulus becomes 
relatively smaller, and less transposition 
occurs. 

Limitations in the generality of the 
absolute theory. While it is true that 
this absolute theory does account for 
transposition, and for the drop in trans- 
position in the far test, some recent re- 
search, especially that of Lawrence and 
his associates, appears to present find- 
ings incompatible with Spence’s account 
of transposition. Baker and Lawrence 
(2) found transposition with rats fol- 
lowing conventional training on a size- 
discrimination problem, but did not find 
transposition when the training stimuli 
were presented successively. Unfortu- 
nately, the two conditions differed in 
one respect other than simultaneity 
vs. successiveness. The animals in the 
simultaneous condition learned to cri- 
terion, and thus received almost 100% 
reinforcement toward the end of train- 
ing. In contrast, the animals in the 
successive condition had the same num- 
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ber of trials with the negative stimulus 
as with the positive stimulus through- 
out training, and consequently were re- 
inforced only 50% of the time. Baker 
and Lawrence pointed out that in spite 
of this methodological flaw, the results 
are not consistent with Spence’s theory. 

Another apparent contradiction of the 
absolute theory is an experiment by 
Lawrence and DeRivera (10). Animals 
were trained in a discrimination prob- 
lem in which, on each trial, two doors 
showed the same stimulus pattern. The 
bottom half of each door was always 
the same medium gray (shade No. 4 
in a set of seven shades). The top half 
of each was either a shade of gray three 
steps lighter (No. 1) than the bottom 
half, or a shade of gray three steps 
darker (No. 7) than the bottom. The 
animals were trained to jump to one 
door when the upper parts were lighter 
than the lower, and to jump to the 
other door when the upper parts were 
darker. After training, the animals were 
given combinations that tested between 
a relational interpretation of learning 
and an absolute interpretation. For ex- 
ample, with No. 3 over No. 1, the ani- 
mal should jump to the side associated 
with the lighter gray, according to the 
absolute hypothesis, because both cards 
are lighter than No. 4; but since the 
relationship is dark over light, they 
should jump the other way according 
to the relational hypothesis. The re- 
sults predominantly supported the rela- 
tional hypothesis. 

Clearly, the results of Lawrence eż al. 
appear to be in contradiction to Spence’s 
theory that the animal responds to the 
Cues in an independent fashion. As 
a possible solution of these apparent 
Contradictions, Lawrence and DeRivera 
(10) and Hilgard (5) have suggested 
that under some circumstances the ani- 
mal responds relationally, and that un- 
der other circumstances it responds ab- 
solutely. That is, perhaps it responds 
absolutely in a simple simultaneous dis- 
crimination situation, but not in others. 


It is possible, however, that the decline 
in transposition in the far test has not 
yet been adequately explained, and that 
an understanding of this phenomenon 
will resolve the apparent inconsistencies. 
Then, we will be in a better position 
to state testable alternatives concerning 
the nature of transposition. 

A contrast interpretation of the drop 
in transposition. If, instead of defining 
each stimulus merely in terms of the 
patch alone, it is defined as a specific 
relationship between the value of the 
patch and the value of the surround, 
then the decline in transposition might 
result merely from the change in the 
relationship being responded to. Such 
a proposal is, of course, not new. In 
human perception, the effect of contrast 
on apparent brightness has been known 
for many years. Recently, Wallach ( 15) 
has shown that two test stimuli of 
radically different illumination will be 
judged approximately equal in bright- 
ness if the ratio of the test-stimulus in- 
tensity to surrounding intensity is the 
same for both stimuli. Similarly, Gibson 
(3) has shown that perceptual size con- 
stancy in humans depends on such fac- 
tors as the texture gradient of the stimu- 
lus field surrounding the test stimulus. 
It is not unreasonable to assume that 
apes and rats might behave in a similar 
way. 

An experimental test of the contrast 
hypothesis. The specific hypothesis to 
be tested is that after a brightness dis- 
crimination has been learned, there will 
be a drop in transposition on the far 
test only if the ratios of the test stimuli 
to their surround are changed. 

The experiment was similar to Ken- 
dler’s in that the rats were trained to 
discriminate brightness. All animals 
were trained in a modified Lashley 
jumping apparatus to jump to the more 
intense of two stimuli. The doors and 
the surrounding stimulation were illu- 
minated from projectors hung near the 
top of the room. The different illumi- 
nations on the doors and the surround 
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were determined solely by slides in the 
projectors, and by the size of openings 
in iris diaphragms on the projectors. 
During training, the brightnesses of the 
two doors and the surround, as meas- 
ured by a Zeiss photometer, were in the 
ratio of 10/3/1, the surround having 
the value of 1. The surround had a re- 
flected illumination of .05 foot candle.? 
During training, the rats were run 10 
trials a day by the noncorrection method. 
Forced trials were used to break position 
habits. Following training to a criterion 
of 19/20 correct responses, all animals 
were tested under four different condi- 
tions. Each animal was tested on only 
one condition in a day, and on days be- 
tween tests the animal was retrained to 
a criterion of 9/10 correct responses. 
Each test consisted of ten trials during 
which the animal was rewarded regard- 
less of the correctness of its choice. 
The order of conditions was counterbal- 
anced. The four conditions were: (a) 
near absolute—the two test patches 
were transposed one step up, and the 
surrounding stimulation was the same 
as in original training; (6) far absolute 
—the two test patches were made 5 
steps brighter and the surrounding 
stimulation was the same as in original 
training; (c) near ratio—the same as 
(a) except that the illumination of the 
surround was increased so that the ratio 
of intensities of the test patches to the 
surround was the same as in original 
training; and (d) far ratio—the same 
as (b) except that the illumination of 
the surround was increased so that the 
ratio of the intensities of the test 
patches to the surround was the same 
as in original training. Sixteen hooded 
rats of the Long-Evans strain were used. 

Predictions. For this experiment to 
offer a test between the absolute and 
the contrast hypotheses, it is essential 


2I wish to thank J. Gulberg, Department of 
Zoology, University of California, for super- 
vising the measurement of the physical stimuli, 
and Arnold Mechanic, who participated in the 
running of the experiment. 


TABLE 1 


MEAN NuMBER OF RESPONSES TO THE BRIGHTER 
StmuLus IN TEN CHOICES 


Near Far Near Far 
Absolute | Absolute | Ratio Ratio 

Mean 9.25 6.94 9.19 9.69 
SD 1.09 1.84 1.36 4 


that there be significantly less transpo- 
sition on the far absolute condition than 
on the near absolute condition. If this 
requirement is met, then the absolute 
theory predicts a similar drop from the 
near ratio to the far ratio condition, for 
the only two parts of the stimulus situa- 
tion that have been differentially rein- 
forced are being changed, just as they 
are in the absolute conditions. On the 
other hand, the contrast hypothesis pre- 
dicts a drop in transposition from the 
near to the far absolute condition but 
no such drop in the ratio condition. If 
the ratio present in training is present 
in the test, then the animal should 
transpose, regardless of the absolute 
values of the stimuli, 

Results. The main results are shown 
in Table 1. Their Meaning is unam- 
biguous. The animals transpose almost 
perfectly in all except the far absolute 
condition. There were no effects of or- 
der. The only effect is a highly signifi- 
cant drop in transposition under the far 
absolute condition. A ¢ test was run in 
which for each S the difference between 
near and far ratio was subtracted from 
the difference between near and far ab- 
solute. The ¢ of 4.87 is significant at 
the .0002 level of confidence, for 15 de- 
grees of freedom. 

Interpretation of the results. It is 
clear, at least in the case of brightness 
discrimination, that the animal responds 
to the relationship between the test 
stimulus and the background. It is a 
change in this relationship rather than 
a change in the absolute value of the 
stimulus that causes the breakdown in 
transposition. While this experiment 
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does not provide an explanation of 
transposition per se, it does show that 
the drop in transposition in the far test 
is not critical evidence for Spence’s ex- 
planation of transposition. The most 
immediate effect of this finding is to 
force a reconsideration of possible inter- 
pretations of transposition. Two such 
possible interpretations that seem to 
this writer to be reasonable alternatives 
will be considered. Again, for sim- 
plicity, the discussion will use bright- 
ness discrimination as the example. 

A reformulation of the E and I gradi- 
ents hypothesis. If the effective stimu- 
lus is redefined as the ratio of the train- 
ing or test patch to the surround, then it 
is possible to maintain the assumption 
that the animal responds separately to 
the two training stimuli. In this case, 
the appropriate description of the stimu- 
lus intensity dimension will also be ex- 
pressed in such ratios. When an animal 
has been trained to approach one stimu- 
lus (defined in ratio terms) and not to 
approach the other, E and I tendencies 
will be reduced to the degree that test 
ratios change from training ratios. If 
these redefined generalization gradients 
z o, the form postulated by Spence, 

is ypothesis can then account for 
aoe ie and for the decline in 
os acme with changing stimulus 

Can this interpretation account for 
the findings of the other experiments 
that were critical of Spence’s absolute 
hypothesis? Lawrence and DeRivera’s 
experiment is somewhat complex for the 
present analysis, because relationships 
within the stimulus patch were changed 
in the tests, but the surrounding stimu- 
lation was always the same as in train- 
ing. Nevertheless, their results at least 
do not appear inconsistent with this by- 
pothesis. If the two parts of the stimu- 
lus patch are responded to in the way 
that the stimulus surround was responded 
to in the present experiment, then their 
animals learned to jump left when the 
ratio on the cards was in one direction 


and right when it was in the other di- 
rection. As the ratios were changed in 
the test conditions, the tendency to 
make the relational response decreased. 

On the other hand, this reformulation 
of the absolute theory does not account 
for the failure of Baker and Lawrence to 
find transposition following successive 
training. Their experiment therefore 
seems critical, and should be repeated 
with the same pattern of reinforce- 
ments in the successive and simultane- 
ous groups. 

A reformulation of the relational hy- 
pothesis. Since the decline in transpo- 
sition in the far test cannot be consid- 
ered as evidence for an absolute defini- 
tion of the stimulus, the question of the 
adequacy of Kéhler’s original relational 
hypothesis is reopened. In his discus- 
sion of transposition, Köhler merely de- 
scribed the animal as having learned to 
go to the larger or lighter part of the 
stimulus. This restricted theory is still 
inadequate, for the reason pointed out 
by Spence. It cannot account for the 
decline in transposition in the far test. 
Nor does it help Köhler’s explanation 
to say that the figure-ground relation- 
ships determine the phenomenal bright- 
ness of the two test stimuli, and that 
the animal approaches the stimulus 
which is phenomenally brighter, for this 
still would not predict a breakdown in 
transposition in the far test. Another 
unworkable relational assumption is that 
the animal responds to the ratio be- 
tween the two figure-ground ratios. This 
hypothesis would state that in the pres- 
ent experiment the animal responds to 
the relationship of 10/1 (the brighter) 
to 3/1 (the dimmer)—that is, to a re- 
lationship of 10/3 or 3.33. Now in the 
critical far-absolute test, the values are 
about 4,100/1 for the brighter stimulus 
and 1,230/1 for the dimmer. This of 
course reduces to 3.33 and perfect trans- 
position is predicted. Since transposi- 
tion is known to decline under these 
circumstances, this hypothesis is inade- 
quate. 
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It seems clear that a workable rela- 
tional hypothesis will be one that pre- 
dicts perfect transposition as long as all 
stimulus values in the critical dimension 
are multiplied by the same factor. If 
the values in the original training are 
in the ratio of 10/3/1, then any test 
situation in which the ratio is the same 
must give perfect transposition. To the 
extent that any of the three parts of the 
stimulus deviates from this, transposi- 
tion will decline. One possible inter- 
pretation of such behavior would be 
that the animal approaches a stimulus 
that is defined by two intensity ratios, 
and does not approach one that is de- 
fined by two other intensity ratios. 
That is, the animal learns to approach 
a training stimulus which has an in- 
tensity that is a certain multiple of the 
intensity of the surround and a certain 
multiple of the intensity of the other 
training stimulus. Similarly, the ani- 
mal learns not to approach a stimulus 
that has an intensity that is a certain 
multiple of the intensity of the sur- 
round and a certain fraction of the in- 
tensity of the other training stimulus. 
This hypothesis has no trouble in ac- 
counting for a decline in transposition 
in the far-absolute situation of the pres- 
ent experiment. It is not so clear why 
there is no drop in transposition in the 
near-absolute situation, for here there is 
also a change in the ratios, One ad hoc 
interpretation is that, under the usual 
conditions of the transposition experi- 
ment, the near-absolute test is not a 
discriminably different relationship from 
the relationship present during training. 
There are a variety of reasons why this 
might be so. For example, since the 
difference between the training stimuli 


and the surround is substantial, a 


3 Although Köhler explicitly made this point 
when discussing the general problem of trans- 
position (cf. 8, p. 218), he did not apply this 
principle either in the design or in the analy- 
sis of specific experiments. In discussing the 
experiments, only the changes in the test 
patches were considered. 


change of only one step may be rela- 
tively hard to detect. Further, twenty- 
four hours intervene between the last 
training trial and the first test trial. 
This increases the difficulty of discrimi- 
nating between the two relationships. 
This particular relational hypothesis 
is very similar to the reformulated ab- 
solute hypothesis. It differs only in 
that each critical stimulus is defined by 
two brightness gradients rather than 
just one. A direct test between these 
two hypotheses can be made in an ex- 
periment in which the stimulus area sur- 
rounding each stimulus patch may be 
independently varied. Then, following 
training identical to the present experi- 
ment, the ratio of test stimulus to sur- 
round could be kept constant, as in the 
ratio conditions of the present experi- 
ment, but the relationship between the 
two test stimuli could be changed from 
that of the training situation; e.g., the 
stimulus-surround value of the positive 
stimulus might be kept at 10/1, while 
the negative is changed from a training 
value of 3/1 to a test value of 30/10. 
As long as the stimulus-surround ratios 
are constant, the reformulated absolute 
hypothesis should predict that the ani- 
mal would continue to approach the test 
stimulus that has the same ratio as the 
positive training stimulus, even though 
the relationship between the test stimuli 
is changed. This should be true regard- 
less of the amount or direction of the 
change. The Proposed relational hy- 
pothesis would predict breakdown in 
transposition as the ratio of the test 
stimuli to each other becomes more dif- 
ferent from that of the training stimuli, 
even though the stimulus-surround ratios 
are the same as in original training. 
There may be other relational hy- 
potheses that would be consistent with 
the present facts. For example, Teas 
and Bitterman (14) have described 
conditions under which the rat appears 
to respond to the stimulus as a total 
configuration rather than responding se- 
lectively to different components of the 
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stimulus. The only diffculty in apply- 
ing this hypothesis to the present data 
is that this experiment was run under 
conditions that are probably closer to 
the condition in their study that elicited 
selective reactions from the rat. 

Two further implications of the pres- 
ent experiment. It is worth noting that 
the present study has implications not 
only for transposition, but for the closely 
related problem of stimulus generaliza- 
tion along quantitative continua. It ap- 
pears that there is a real possibility that 
if the relationships existing in original 
conditioning are present in generaliza- 
tion tests, stimulus generalization will be 
complete regardless of absolute stimulus 
values. Directly related to this point 
is an experiment by Perkins (12) that 
indicates that stimulus intensity dyna- 
mism is probably a contrast phenome- 
non. 

_ The second point is more of a ques- 
tion than a direct implication of the 
Present experiment. How much of trans- 
Position might be accounted for by sub- 
Ce or even retinal mediation rather 
2 an b cortical processes? Some time 
tence ee presented evidence that 
Bias eon is probably 
subcortically. More recently 
Motokowa (11) has presented evidence 
from retinal recordi TF ornate 
dicating that aE ef ee 
action potentials are re- 


lated to bri 
E ghtness c 
intensity. ontrast rather than 


SUMMARY 


Evidence has been presented that in 
ae discrimination learning, as in 
ea Perception, the effective stimu- 
us to which the animal responds must 
be described as a relationship between 
Parts of a stimulus complex. This find- 
ing allows a reinterpretation of Spence’s 
mae lanation of the decline in transposi- 
ea when test stimuli are remote from 
fain ing stimuli. It also sets the con- 
ditions for a reinterpretation of trans- 
Position. Two alternatives are discussed. 
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THE DOMAIN OF HOMEOSTASIS* 


R. C. DAVIS 


Indiana University 


In the halls of our neighboring physi- 
ology department hangs the celebrated 
motto of Claude Bernard: Le fixité du 
milieu interieur est la condition de la 
vie libre. Of late, in our own discipline, 
as well as in physiology, one may ob- 
serve a spreading and unchallenged faith 
in the virtue of this apothegm and its 
sequel, that all the “vital mechanisms” 
are devoted to the maintenance of this 
stability. It is thought to contain the 
germ which, cultivated by Cannon (3) 
into the full-flowered doctrine of ho- 
meostasis, now offers general shelter for 
psychologists. The “maintenance of 
equilibrium” has often been taken as 
the unifying principle of “motivation,” 
and more recently as fundamental to all 
action of the living (1, 5).? As a model 
for behavior the principle of homeostasis 
presents itself in rivalry with the time- 
honored formula of stimulus-response or 
of any more modern equivalent. 

Clearly this is a possibility of such 
importance that the meaning and do- 
main of “homeostasis” should be given 
careful thought. Homeostasis and adap- 


1 Based on a lecture given at McGill Uni- 
versity and University of Oslo, September and 
October, 1956. Assistance in preparation was 
given by Contract Nonr 908-03, Office of 
Naval Research. 

2The form of Bernard’s principle implies 
that he did not think of it as universal; the 
“free life” seems to be something beyond the 
regulation he is describing. It is very clear 
that Cannon (3) did not think homeostasis 
ruled everything, for he asks “. . . freedom 
for what?” and answers “In summary, then, 
we find the organism liberated for its more 
complicated and socially important tasks be- 
cause it lives in a fluid matrix, which is auto- 

matically kept in a constant condition.” Gen- 
erally he regards the action of skeletal muscles 
as disturbing to homeostasis. 


tation are sometimes treated as though 
one implied the other. But definitions 
of the two terms are usually not thus 
restricted, and the facts do not justify 
an inference of this sort. The Wisdom 
of the Body in both title and content 
shows that Cannon was strongly im- 
pressed with the “adaptiveness” of ho- 
meostatic regulation, and he showed no 
qualms about including purposive clauses 
in his sentences. He observed, never- 
theless, that “. . . at times effects are 
produced which apparently have little 
or no value to the organism,” and that 
infants and lower animals lack many 
homeostatic mechanisms. (Though he 
did not discuss the point, it is doubtful 
that he meant these were ill adapted.) 
The meaning of “adaptation” is none 
too clear in a scientific context, and 
even if it were established that a cer- 
tain event is adaptive, it would follow 
only with an unknown probability that 
it is also homeostatic. 

Two definitions of homeostasis seem 
possible on the basis of current and his- 
torical usage. It is difficult to say which 
is more legitimate, for they were doubt- 
less commingled in the original writings. 
Cannon, and in the earlier period Ber- 
nard, were evidently led to the general 
conception because of some physiologi- 
cal facts which came to light. These 
discoveries showed that certain physio- 
logical variables are more constant than 
one would expect in a changing world, 
and that there are some specific mecha- 
nisms, now called negative feedbacks, 
which act, when a change occurs, in 
such a manner as to minimize the 
change. Either the first or the second 
part of the preceding sentence may be 
taken as the basic meaning of homeosta- 
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sisë In other words, homeostasis can 
mean, and evidently does sometimes 
mean, simply the existence of a certain 
constancy in the face of change. On 
the other hand, one may use the name 
only for a certain type of mechanism. 
In the one case it is an end, in the other 
a means. (Of course, both meanings at 
once may be intended, though such re- 
striction seems unusual.) 

Though often not distinguished, the 
two meanings are not the same. Sta- 
bility, resistance to change, or even res- 
toration of a prior state is not invari- 
ably a sign of a negative feedback. This 
could, no doubt, be demonstrated by a 
mathematical argument, but the point 
can be made as convincingly with less 
formality. 

Evidently the behaving organism is 
one of those physical structures called 

systems.” It qualifies as one of these 
because it has outputs—movements of 
body, hands, vocal cords, etc.—in some 
kind of relation to inputs, some of which 
are usually called stimuli and some of 
which are more general conditions. If 
we follow the usual presumption, this 
relation results from the operation of 
ah aaa laws which apply to 
cal ystems. Accepting this identifi- 
itn ees we may know of sys- 
es & everyday experience as well 
es e mathematically developed 

ty of systems—will come to our aid. 
_ Although one could not hope to de- 
cide exactly what happens on such gen- 
eral grounds, the field of possibilities 
can be helpfully delimited. If, for ex- 


“a wrote “ . . constancy is in itself 
act ra that agencies are acting, or ready to 
tion maintain this constancy.” This impli- 
feedb: goma cover other mechanisms than 
sim a , and indeed he considers “storage” of 
an n capacitative sort an example of such 
Cies, PE a In considering specific constan- 

, however, Cannon apparently takes it for 
granted that the “agency” is a feedback, even 


though th : 8 
Ch. 8). e mechanism is unknown (eS 3 


ample, the principle of conservation of 
energy places certain restrictions on a 
system, it would be an extraordinarily 
bold hypothesis which would propose 
that the organism transcends them. 

In the case at hand—i.e., the organ- 
ism’s stability—we may look at simpler 
systems to find, if we can, the same sort 
of stability. To begin with a very hum- 
ble example, let us reflect on the be- 
havior of an ordinary screen door being 
pushed by a small boy who likes to ex- 
periment. This, too, is a system with 
an input. If he unhooks the spring the 
boy will be pleased to find that the more 
push he gives the farther the door will 
go before it stops. It will be even more 
of a delight to find, after the spring is 
attached, that the door now pushes back 
at him. The more he pushes, the more 
the door reacts, and when he lets go the 
door will return to its original place. 
In more technical words, he has now 
observed that the door has a nonlinear 
response to a step-function input, and 
returns to zero after a square-wave in- 
put. It is displaced less than expected 
by an input force, and has “automatic” 
stabilization at a point of equilibrium. 
By other examples one could show that 
such a system can oscillate (as even a 
screen door can bang twice if conditions 
are right). These kinds of behavior are 
in fact to be found in any system which 
contains a reacting element—such an 
element as produces a restoring force as 
a funtion of input force, and this is a 
very common feature of systems. Re- 
storing forces are provided by springs 
or other elastic bodies, condensers and 
inductors, heat-absorbing elements and 
chemical processes. Some elements of 
this kind are present in all systems, 
and it is a major engineering problem 
to construct a system in which their 
effect is too small to matter. With a 
moment’s thought we can find all of 
these elements, except inductors, in the 
living organism. There is elasticity in 
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all the tissues, and in muscles and blood 
vessels it is known to be important; elec- 
trical and heat capacities exist through- 
out the body, and the piling up of 
chemical energy occurs in many places. 
Clearly then, the “stability” of a re- 
Sponse system is not much of an argu- 
ment for the existence of a feedback 
mechanism. The same effects, to the 
closest approximation we are likely to 
be able to measure, can be brought 
about in a simpler fashion with a sys- 
tem which is reactive. To complete the 
indirect argument one would need to 
exclude in some way the possibility of 
the simpler mechanism in the case at 
hand; still better evidence would come 
from a direct demonstration of negative 
feedback. 
At this point it is interesting to note 
a statistical argument which leads to 
the same conclusion.* Suppose a sys- 
tem can be in a certain number- of 
states, ordered in some respect from 
lowest to highest. For simplicity, then 
suppose that each of these states is 
equally probable and in no way con- 
tingent upon any other. Upon finding 
the system in a state toward the top of 
the scale one would predict that its next 
state, if there is one, will be a lower one, 
simply because there are more states be- 
low than above it in the scale. Their 
summed probabilities will be greater. 
Certainly if the system is already at 
the top of the scale, its next state will 
always be lower, because by definition 
there is nowhere to go but down, and 
the best prediction would be the mean 
of the possible states. If the organism 
is already in its middle state one would, 
of course, predict its next state as a de- 
parture from that, but the mean of the 
possible states would still be the best 
prediction. (If this is not to be counted 
as a “next state,” the best prediction 
would be as close as possible to the 


4This line of thought was’ given me by 
C. J. Burke. 


mean.) Further, suppose that a meas- 
ured property, such as blood pressure, 
is the resultant of a considerable num- 
ber of factors acting according to this 
description. The measured variable then 
tends actually to assume the mean po- 
sition, merely by the random action of 
the many factors. An additional prop- 
erty of the model, in either simple or 
multiple form, is that the probability of 
finding it in a state deviating from the 
mean by a certain value decreases with 
the size of the deviation. (These con- 
sequences will also follow from assump- 


tions more general than the ones used 
for illustration.) 


It is hard to believe that the organ- 
ism is so disjointed as these postulates 
would make it. Yet one may wonder 
whether the laws connecting states of 
the organism are so many that the effect 
is the same after all. 

Inasmuch as stability and equilibra- 
tion are so common in systems, the term 
homeostasis would diffuse over the land- 
scape if it were used in a functional 
sense to specify a certain result, If it 
settles evenly on everything and d 
guishes nothing, then to say t 
system is homeostatic js tautol 
though possibly useful, 

According to its second 
meostasis would be defined as a par- 
ticular sort of Process, regardless of re- 
sult. Identified as a negative feedback 
its basic requirement is that some en- 
ergy is taken from a later part of the 
system and introduced into an earlier 


part in such a way as to oppose the 


change produced there by the original 
input. Certainly quite a number of 
such mechanisms in the organism are 
known; there are others sometimes al- 
leged in rather easy-going interpreta- 
tion, for which one might desire better 
credentials. Physiologists (as well as 
Psychologists) seem sometimes to start 
their interpretations with the postulate 
that every process is homeostatic, 


istin- 
hat a 
logical, 


meaning, ho- 
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If homeostasis as a process is to serve 
as a general model for behavior, we 
should be able to say what information 
there needs to be to decide that a sys- 
tem is homeostatic. From the definition 
it follows that three things are neces- 
sary: (a) We should know what the sys- 
tem is, at least to the extent of identi- 
fying its input and output. (b) We 
should be able to demonstrate that en- 
ergy (“signal”) is, or at least can be, 
carried in a reverse direction. (c) We 
should know that this energy has a 
negative sign with respect to the origi- 
nal process when it re-enters the main 
path of the system. If these three 
things are the meaning of homeostasis 
in the narrower sense, it is plain that 
we need a clear idea of the first and 
a reasonable presumption of the other 
two, lest an assertion of homeostasis be 
but empty words. One should be on 
guard against theorizing which starts 
from one of the three requirements and 
draws upon nothing but a rich vocabu- 
lary for the other two. 

We may contrast two examples of 
supposed homeostatic mechanisms, one 
which seems complete and another 
which seems filled out ad koc. First, 
in the case of the stretch or myotatic 
reflex, we know the system concerned. 
Its output can be taken as the length of 
the muscle. The feedback is initiated 
by the receptors in the muscle and is 
carried backward by the afferents. Its 
effect is negative with respect to muscle 
lengthening. In the well known regula- 
tion of the heart the case is much less 
clear. The known fact is that recep- 
tors in the large arteries are stimulated 
by blood pressure increments and initi- 
ate a reflex which slows the heart rate. 
Then, to fill out the homeostatic pic- 
ture, it is said that this reduces blood 
Pressure. In this instance we collide 
With some troublesome facts. By Star- 
ling’s law, an increased rate would sim- 
Ply reduce the stroke volume and have 


just about no effect on pressure in the 
arteries. And according to Marey’s law, 
another empirical classic, a slow pulse 
is associated with high rather than low 
pressure. To add to the confusion, 
there are receptors in large veins, re- 
sponsive to raised pressure there, which 
increase the heart rate by the so-called 
Bainbridge reflex, and it has been as- 
serted that this too is homeostatic: it 
keeps down the stroke volume of the 
heart. At this point it seems we might 
conclude that the beautiful simplicity of 
homeostatic regulation has been too ap- 
pealing, even in physiology. 

In this instance the trouble seems to 
be an uncertainty about what is being 
regulated by a known mechanism. Of 
course, so long as a process has any 
effect at all, it is always possible to find 
something or invent something which it 
regulates, and in each case one would 
have quite a wide choice. This is a par- 
ticular danger when homeostasis is taken 
as a general model. We may be offered 
variables or entities which exist only to 
be opposed by homeostasis, and con- 
versely homeostatic processes which are 
said to exist only because they are op- 
posite to some known effect, according 
to a sort of Hegelian logic. 

An extreme example of such specula- 
tion, once offered in all seriousness, runs 
as follows: People have a homeostatic 
mechanism for maintaining social esteem, 
or the awareness of it. So, when they 
become aware that they have fallen in 
the opinion of their fellow men, they 
take action to raise their standing until 
their awareness of esteem reaches the 
point of equilibrium. The factual basis 
of this, so far as visible, is that people 
sometimes do things that others approve 
of, and it is doubtful that we know any 
more about this after hearing the theory. 

A process entitled to the name of ho- 
meostasis by meeting the three criteria 
would not necessarily return the organ- 
ism to its undisturbed state. Being a 
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negative feedback it would tend to lessen ` 


departure from such a condition, but 
need not be so powerful as to prevent 
it. The milder sort of feedback would 
tend to “overshooting” and oscillation, 
while admitting some change. Quite 
possibly this sort of feedback is the 
more usual in bodily processes. 

Where we do have homeostasis, in the 
sense of a negative feedback, there are 
implications which would be important 
to a general theory. One of these de- 
serves the emphasis which may be given 
it by a special name: heterostasis. The 
following example shows what the word 
is intended to mean. In humans and 
some other animals body temperature is 
said, quite correctly, to be subject to 
homeostatic regulation. A large part of 
the regulation on the high side, for ani- 
mals like us, is accomplished by the 
evaporation of water secreted through 
the skin, A rise in external tempera- 
ture, or an increase in heat production, 
is followed by a slight rise in internal 
temperature which activates the sweat- 
ing mechanism through a certain well 
known route. And as perspiration is 
evaporated, heat is removed from the 
body and temperature lowered. It seems 
pretty clear that the regulated output of 
the system is temperature. But while 
this is being regulated, what is happen- 
ing to the sweating mechanism? Neces- 
sarily every bit of reduced variation 
in temperature is bought by increased 
variation in sweating. Likewise where 
temperature is regulated during cold, 
there is, on the other side of the pic- 
ture, an increase in heat production. 
This, one may call heterostasis. 

On an argument from the conserva- 
tion of energy, the case of sweating is 
not a special one, but rather is typical. 

In general, the organism, let us say, 
is acted upon by a force of a certain 
energy. The displacement produced by 
that force cannot be neutralized without 
the use of further force, inasmuch as the 
energy of the new state cannot be made 
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to vanish. In the case of homeostasis, 
it is the organism itself which provides 
the restoring energy, by increased sweat- 
ing or shivering, by changes in circu- 
lation, changes in digestion, and ulti- 
mately changes in fuel consumption. In 
cases of changed energy relations be- 
tween the organism and environment, 
then, homeostasis can be maintained for 
one variable only at the expense of 
heterostasis in at least one other. 

The chief homeostatic systems, like 
that for temperature regulation, seem to 
have feedback circuits with rather short 
loops, turning back from a point near 
the output and opposing the system re- 
sponse at some point not much earlier. 
Yet there are some feedbacks, perhaps 
longer ones, which regulate the admis- 
sion of energy to the organism. Such 
are the tympanic and iridic reflexes, and 
possibly certain mechanisms which af- 
fect the sensitivity of afferent nerves, 
By operating at the input these stabilize 
the whole system to some degree, and 
heterostasis is not so obvious. Never- 
theless one must recognize that it exists. 
Because the system operates as an am- 
plifier, introducing energy along the way, 
it is less costly to oppose system re- 
sponse at an early stage. Yet even here 
a certain amount of work must be per- 
formed: the tympanic and iridic muscles 
do, of course, consume energy which 
they would not if they were not regu- 
lating input. Their action, therefore, 
shows greater variation while the main 
auditory and visual channels are show- 
ing less. 

The point seems general that an or- 
ganism’s constancy in one aspect im- 
plies variability in another. It is speak- 
ing rather loosely, therefore, to say that 
an organism as a whole maintains an 
internal constancy in spite of external 
variations. Rather, some features of its 
internal state are kept constant by ho- 
meostasis, some by simpler means, and 
some vary in compensatory fashion. 

There is one final argument for the 
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generality of homeostasis to be weighed: 
the argument from its necessity for sur- 
vival. If survival were absolute, the 
argument would be more compelling, for 
it would be difficult to see how any- 
thing but an ad hoc mechanism could 
counter all the vagaries of the environ- 
ment. But organisms and species are 
both mortal, and it seems quite possible 
that their limited survival can occur 
without their having the best of im- 
aginable mechanisms for its assurance. 
It is hard to say just what life span 
would be permitted by the expected 
combination of good and bad accidents. 

So far the argument has concerned 
cases in which there is a tendency to- 
ward equilibrium somewhere. But there 
are physiological mechanisms which work 
toward the opposite end, lowering the 
threshold for an activity which is al- 
ready under way. There is, for exam- 
ple, the facilitation effect described by 
Gellhorn and his collaborators (2): a 
muscular activity will lower the cortical 
threshold for more of the same activity. 
On the neuronal level it is well known 
that a large depolarization (spike po- 
tential) is more easily produced in a 
tissue which already has a small one 
(local bio-electric disturbance). Prior 
stimulation of the muscle spindles over 
the “gamma system” increases their sus- 
ceptibility to later stimuli (4). Some 
of these positively accelerated phe- 
nomena depend on positive feedback, 
and others do not.’ In organismic be- 
havior there are likewise events, such 
as perseveration, which suggest a posi- 
tive feedback. In general, a reinforce- 
ment effect, contingent upon an action 
and leading to its repetition, fits quite 
well the model of positive feedback. 
Learning may thus be represented by 
such a model. Still more generally it 


5A. Lundervold called my attention to the 
frequent occurrence of positive feedback in 
Neural processes. 


is quite usual that a response is far more 
energetic than its stimulus. Instead of 
resisting a sensory disturbance, the or- 
ganism amplifies it. While this is not 
necessarily a positive feedback, it is 
often hard to see that it is equilibration, 
either. Possibly the application of the 
activating and positive feedback models 
should be explored more than it has 
been. 

Returning to the original question, 
the answer is that homeostasis would 
not serve as a good all-around model for 
behavior. Using the word in a very 
broad functional sense will merely en- 
large the vocabulary beyond necessity. 
If we use it in the narrow sense of a 
negative feedback mechanism, we find 
that the model does not, and indeed 
cannot, fit all the facts. We can make 
it conform by adding a series of inven- 
tions to the facts observed, but there 
seems no reason for doing so. Ho- 
meostasis exists, of course, with respect 
to certain variables, and one should try 
to find out what these are. But it is to 
be understood as a special case of the 
more general conception of the response 
of systems to inputs. There is no com- 
pulsion to think that the organism is an 
elaborate machine for the purpose of 
getting itself back to the status quo 


s! 
ante, or, indeed, for any other purpose. 
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HOW SHALL A THING BE CALLED? 
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The most deliberate part of first-lan- 
guage teaching is the business of telling 
a child what each thing is called. We 
ordinarily speak of the name of a thing 
as if there were just one, but in fact, of 
course, every referent has many names. 
The dime in my pocket is not only a 
dime. It is also money, a metal object, 
a thing, and, moving to subordinates, it 
is a 1952 dime, in fact a particular 1952 
dime with a unique pattern of scratches, 
discolorations, and smooth places. When 
such an object is named for a very young 
child how is it called? It may be named 
money or dime but probably not metal 
object, thing, 1952 dime, or particular 
1952 dime. The dog out on the lawn is 
not only a dog but is also a boxer, a 
quadruped, an animate being; it is the 
landlord’s dog, named Prince, How will 
it be identified for a child? Sometimes 
it will be called a dog, sometimes Prince, 
less often a boxer, and almost never a 
quadruped, or animate being. Listening 
to many adults name things for many 
children, I find that their choices are 
quite uniform and that I can anticipate 
them from my own inclinations. How 
are these choices determined and what 
are their consequences for the cognitive 
development of the child? 

Adults have notions’ about the kind of 
language appropriate for use with chil- 
dren. Especially strong and universal 
is the belief that children have trouble 

pronouncing long names and so should 
always be given the shortest possible 
names. A word is preferable to a phrase 
and, among words, a monosyllable is 
better than a polysyllable. This pre- 
dicts the preference for dog and Prince 
over boxer, quadruped, and animate be- 


ing. It predicts the choice of dime over 
metal object and particular 1952 dime. 

Zipf (10) has shown that the length 
of a word (in phonemes or syllables) 
is inversely related to its frequency in 
the printed language. Consequently the 
shorter names for any thing will usu- 
ally also be the most frequently used 
names for that thing, and so it would 
seem that the choice of a name is usu- 
ally predictable from either frequency 
or brevity. The monosyllables dog and 
Prince have much higher frequencies ac- 
cording to the Thorndike-Lorge list (8) 
than do the polysyllables boxer, quad- 
ruped, and animate being, 

It sometimes happens, however, that 
the frequency-brevity principle makes 
the wrong prediction. The thing called 
a pineapple is also fruit. Fruit is the 
shorter and more frequent term, but 
adults will name the thing pineapple. 
Similarly they will say apple, banana, 
orange, and even pomegranate; all of 
them longer and less frequent words 
than the perfectly appropriate fruit. 
Brevity seems not to be the powerful 
determinant we had imagined. The fre- 
quency principle can survive this kind 
of example, but only if it js separated 
from counts like the Thorndike-Lorge of 
over-all frequency in the printed lan- 
guage. On the whole the word fruit 
appears more often than the word pine- 
apple (and also is shorter), but we may 
confidently assume that, when pineap- 
ples are being named, the word pine- 
apple is more frequent than the word 
fruit. This, of course, is a kind of fre- 
quency more directly relevant to our 
problem. Word counts of general usage 
are only very roughly applicable to the 
prediction of what will be said when 
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something is named. What we need is 
referent-name counts. We don’t have 
them, of course, but if we had them it 
is easy to see that they would improve 
our predictions. Bananas are called ba- 
nana, apples apple, and oranges orange 
more often than any of them is called 
fruit. The broad frequency-brevity prin- 
ciple predicts that money and dime will 
be preferred to metal object, 1952 dime, 
and particular 1952 dime, but it does 
not predict the neglect of the common 
monosyllable thing. For this purpose 
we must again appeal to imagined re- 
ferent-name counts, according to which 
dimes would surely be called dime or 
money more often than thing. 

While the conscious preference for a 
short name can be overcome by fre- 
quency, the preference nevertheless af- 
fects the naming act. I have heard 
parents designate the appropriate ob- 
jects pineapple, televsion, vinegar, and 
policeman; all these to children who 
cannot reproduce polysyllabic words. 
Presumably they use these names be- 
cause that is what the referents are 
usually called, but the adult’s sense of 
the absurdity of giving such words to 
a child is often evident. He may smile 
as he says it or remark, “That’s too 
hard for you to say, isn’t it?” 

Some things are named in the same 
way by all adults for all children. This 
is true of the apple and the orange. 
Other things have several common 
names, each of them used by a speci- 
fiable group of adults to specifiable chil- 
dren. The same dog is dog to most of 
the world and Prince in his own home 
and perhaps on his own block. The 
Same man is a man to most children, 
policeman to some at some times, Mr. 
Jones to the neighborhood kids, and 
Papa to his own. Referent-name counts 
from people in general will not predict 
these several usages. A still more par- 
ticular name count must be imagined. 
The name given a thing by an adult for 


a child is determined by the frequency 
with which various names have been ap- 
plied to such things in the experience of 
the particular adult. General referent- 
name counts taken from many people 
will predict much that the individual 
does, but, for a close prediction, counts 
specific to the individual would be 
needed. 

The frequencies to which we are now 
appealing have not, of course, been re- 
corded. We are explaining imagined 
preferences in names by imagined fre- 
quencies of names. It is conceivable, 
certainly, that some of these specific 
word counts might be made and a fu- 
ture naming performance independently 
predicted from a past frequency. Prob- 
ably, however, such frequencies will 
never be known, and if we choose to 
explain particular naming performances 
by past frequencies we shall usually 
have to infer the frequency from the 
performance. 


BEYOND THE FREQUENCY PRINCIPLE 


A frequency explanation is not very 
satisfying even when the appeal is to 
known frequencies. The question will 
come to mind: “Why is one name more 
common than another?” Why is a dog 
called dog more often than quadruped 
and, by some people, called Prince more 
often than dog? Perhaps it just hap- 
pened that way, like driving on the 
right side of the road in America and 
on the left in England. The convention 
is preserved but has no justification out- 
side itself. As things have worked out, 
coins are usually named by species as 
dime, nickel, or penny while the peo- 
ple we know have individual names like 
John, Mary, and Jim. Could it just as 
easily be the other way around? Might 
we equally well give coins proper names 
and introduce people as types? 

The referent for the word dime is a 
large class of coins. The name is equally 
appropriate to all members of this class. 
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To name a coin dime is to establish its 
equivalence, for naming purposes, with 
all other coins of the same denomina- 
tion. This equivalence for naming pur- 
poses corresponds to a more general 
equivalence for all purposes of economic 
exchange. In the grocery one dime is as 
good as another but quite different from 
any nickel or penny. For a child the 
name given an object anticipates the 
equivalences and differences that will 
need to be observed in most of his deal- 
ings with such an object. To make 
proper denotative use of the word dime 
he must be able to distinguish mem- 
bers of the referent category from every- 
thing else. When he learns that, he has 
solved more than a language problem. 
He has an essential bit of equipment 
for doing business. The most common 
names for coins could not move from 
the species level to the level of proper 
names without great alteration in our 
nonlinguistic culture. We should all be 
numismatists preparing our children to 
recognize a particular priceless 1910 
dime. 
Many things are reliably given the 
same name by the whole community. 
The spoon is seldom called anything but 
spoon, although it is also a piece of 
silverware, an artifact, and a particular 
ill-washed restaurant spoon. The com- 
munity-wide preference for the word 
Spoon corresponds to the community- 
wide practice of treating spoons as 
equivalent but different from knives and 
forks. There are no proper names for 
individual spoons because their indi- 
viduality seldom signifies. It is the 
same way with pineapples, dimes, doors, 
and taxicabs. The most common name 
for each of these categorizes them as 
they need to be categorized for the com- 
munity’s nonlinguistic purposes. The 
most common name is at the level of 
usual utility. anes 
People and pets have individual names 
as well as several kinds of generic name. 
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The individual name is routinely coined 
by those who are disposed to treat the 
referent as unique, and is available after- 
wards to any others who will see the 
uniqueness. A man at home has his 
own name to go with the peculiar privi- 
leges and responsibilities binding him to 
wife and child. But the same man who 
is a one-of-a-kind papa to his own chil- 
dren is simply a man to children at 
large. He is, like the other members 
of this large category, someone with no 
time to play and little tolerance for 
noise. In some circumstances, this same 
man will be given the name of his occu- 
pation. He is a policeman equivalent 
to other policemen but different from 
bus drivers and Good Humor men. A 
policeman is someone to “behave in 
front of” and to go to when lost. To 
the kids in the neighborhood the man 
is Mr. Jones, unique in his way—a 
crank, bad tempered, likely to shout at 
you if you play out in front of his 
house. It is the same way with dogs 
as with people. He may be a unique 
Prince to his owners, who feed and 
house him, but he is just a dog to the 
rest of the world. A homeless dog 
reverts to namelessness, since there is 
none to single him out from his species. 
Dimes and nickels have much the same 
significance for an entire society, and 
their usual names are fixed at this level 
of significance. People and pets func- 
tion uniquely for some and in various 
generic ways for others, They have a 
corresponding variety of designations, 
but each name is at the utility level for 
the group that uses it. Our naming 
practices for coins and people corre- 
spond to our nonlinguistic practices, and 
it is difficult to imagine changing the 
one without changing the other. 

The names provided by parents for 
children anticipate the functional struc- 
ture of the child’s world.* This is not, 


1 The equivalence of dimes and their distinc- 
tiveness as a class from nicia and pennies 
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of course, something parents are aware 
of doing. When we name a thing there 
does not seem to be any process of 
choice. Each thing has its name, just 
one, and that is what we give to a child. 
The one name is, of course, simply the 
usual name for us. Naming each thing 
in accordance with local frequencies, 
parents unwittingly transmit their own 
cognitive structures. It is a world in 
which Prince is unique among dogs and 
papa among men, spoons are all alike 
but different from forks. It may be a 
world of bugs (to be stepped on), of 
flowers (not to be picked), and birds 
(not to be stoned). It may be a world 
in which Wiggers, like spoons, are all of 
a kind. A division of caste creates a 
vast categorical equivalence and a cor- 

. respondingly generic name. Mr. Jones 
and Mr. Smith do not come out of racial 
anonymity until their uniqueness is ap- 
preciated. 

Adults do not invariably provide a 
child with the name that is at the level 
of usual utility in the adult world. An 
effort is sometimes made to imagine the 
utilities of a child’s life. Some parents 
will, at first, call every sort of coin 
money. This does not prepare a child 
to buy and sell, but then he may be too 
young for that. All coins are equiva- 


is strongly suggested by the appearance of 
individual coins as well as by their names. 
Variations in size, weight, and hue are far 
Greater between classes than within a class. 
This, of course, is because coins are manufac- 
tured in accordance with a categorical scheme 
which is also represented in our names for 
coins. It is possible, then, that a child might 
Structure coins in the culturally approved 
manner if he never heard them named at all. 

Owever, we cannot be sure that an untutored 
child would not put all shiny new coins into 
One class and all the dingy specimens into an- 
other. When the referents are not manufac- 
tured articles but are such things as dogs, peo- 
ple, flowers, and insects, it is clear that au- 
tochthonous factors in perception do not force 
any single scheme of categorization. The 
names applied must be the child’s principal 
clue to the lonely functioning scheme. 


lent for the very young child in that 
they are objects not to be put into 
the mouth and not to be dropped down 
the register, and money anticipates that 
equivalence. A more differentiated ter- 
minology can wait upon the age of store- 
going. Sometimes an adult is aware of 
a child’s need for a distinction that is 
not coded in the English lexicon. A 
new chair comes into the house and is 
not going to be equivalent to the shabby 
chairs already there. A child is per- 
mitted to sit on the old chairs but will 
not be permitted on the new one. A 
distinctive name is created from the 
combinational resources of the language. 
The new chair or the good chair is not 
to be assimilated to chairs in general. 

Eventually, of course, children learn 
many more names for each thing than 
the one that is most frequent and use- 
ful. Sometimes a name is supplied in 
order to bring forward an immediately 
important property of the referent. A 
child who starts bouncing the coffee pot 
needs to be told that it is glass. Some- 
times a name is supplied to satisfy the 
child’s curiosity as to the place of a 
referent in a hierarchy of categories. 
Chairs are furniture and so are tables; 
carrots are a vegetable but apples are 
not. Probably, however, both children 
and adults make some distinction among 
these various names. The name of a 
thing, the one that tells what it “really” 
is, is the name that constitutes the refer- 
ent as it needs to be constituted for most 
purposes. The other names represent 
possible recategorizations useful for one 
or another purpose. We are even likely 
to feel that these recategorizations are 
acts of imagination, whereas the major 
categorization is a kind of passive rec- 
ognition of the true character of the 
referent. 


THE CHILD’S CONCRETE VOCABULARY 


It is a commonplace saying that the 
mind of a child is relatively “concrete” 
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and the mind of an adult “abstract.” 
The words “concrete” and “abstract” 
are sometimes used in the sense of 
subordinate and superordinate. In this 
sense a relatively concrete mind would 
operate with subordinate categories and 
an abstract mind with superordinate 
categories. It is recorded in many 
studies of vocabulary acquisition (e.g., 
2, 6) that children ordinarily use the 
words milk and water before the word 
liquid; the words apple and orange be- 
fore fruit; table and chair before furni- 
ture; mamma and daddy before parent 
or person; etc. Very high-level super- 
ordinate terms like article, action, qual- 
ity, and relation, though they are com- 
mon in adult speech (8), are very sel- 
dom heard from preschool children (2). 
Presumably this kind of vocabulary 
comparison is one of the sources of the 
notion that the child’s mind is more 
concrete than the mind of the adult.? 


2From the facts of vocabulary acquisition 
alone it is not possible to draw safe conclu- 
sions about cognitive development. Such con- 
clusions rely on something like the following 
set of assumptions. A subject, whether ani- 
mal or human, is ordinarily credited with a 
cognitive category when he extends some dis- 
tinctive response to new instances of the cate- 
gory and withholds it from noninstances. 
Words, when used to denote new referents, 
are such a distinctive response. If children 
speak words they probably can make correct 
denotative use of them, and so the presence 
of the word in a child’s vocabulary may be 
taken as evidence that he Possesses the cate- 
gory to which the word makes reference. The 
instances of the category are presumed not to 
be differentiated by the child unless he uses 
words for such differentiations. If all of these 
assumptions are made it would seem to follow 
that the direction of vocabulary growth (from 
subordinate to superordinate or vice versa) 
reveals the direction of cognitive development. 
When the assumptions of such an argument 
are explicitly stated, it is clear that they are 
too many and too doubtful. Obviously words 
may be spoken but not understood ; objects 
may be differentiated by nonlinguistic response 
even though they are not differentiated lin- 
guistically. However, it is not my purpose 
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However, the vocabulary of a child is 
not a very direct index of his cognitive 
preferences. The child’s vocabulary is 
more immediately determined by the 
naming practices of adults. 

The occasion for a name is ordinarily 
some particular thing. In the naming 
it is categorized. The preference among 
possible names seems to go to the one 
that is most commonly applied to the 
referent in question. That name will 
ordinarily categorize the referent so as 
to observe the equivalences and differ- 
ences that figure in its usual utilization. 
There are not many purposes for which 
all liquids are equivalent or all fruits, 
furniture, or parents; and so the names 
of these categories are less commonly 
used for denotation than are the names 
of categories subordinate to them. It 
is true that words like article, action, 
quality and relation are rather common 
in adult written English, but we can be 
sure that these frequencies in running 
discourse are not equaled in naming 
situations. Whatever the Purposes for 
which all articles are equivalent, or all 
actions or qualities, they are not among 
the pressing needs of children. 

It is not invariably true that vocabu- 
lary builds from concrete to abstract. 
Fish is likely to be learned before perch 
and bass; house before bungalow and 
mansion; car before Chevrolet and Ply- 
mouth (6). The more concrete vocabu- 
lary waits for the child to reach an age 
where his purposes differentiate kinds of 
fish and makes of cars. There is much 
elaborately concrete vocabulary that is 
not introduced until one takes courses 
in biology, chemistry, and botany. No 
one has ever proved that vocabulary 
builds from the concrete to the abstract 


here to quarrel with these assumptions but 
rather to show that, even when they are ac- 
cepted, the facts of vocabulary growth do not 
compel the conclusion that cognitive develop- 
ment is from the concrete to the abstract. 


How SHALL A THING Be CALLED? 19 


more often than it builds from the ab- 
stract to the concrete. The best gen- 
eralization seems to be that each thing 
is first given its most common name. 
This name seems to categorize on the 
level of usual utility. That level some- 
times falls on the most concrete cate- 
gories in a hierarchy (proper names for 
significant people), and vocabulary then 
builds toward the more abstract cate- 
gories (names for ethnic groups, per- 
sonality types, social classes). Utility 
sometimes centers on a relatively ab- 
stract level of categorization (fish) and 
vocabulary then builds in both direc- 
tions (perch and vertebrate). Probably 
utility never centers on the most ab- 
stract levels (thing, substance, etc.), 
and so probably there is no hierarchy 
within which vocabulary builds in an 
exclusively concrete direction. 

In the literature describing first-lan- 
guage acquisition (5) there is much to 
indicate that children easily form large 
abstract categories. There are, to be- 
gin with, the numerous cases in which 
the child overgeneralizes the use of a 
conventional word. The word dog may, 
at first, be applied to every kind of 
four-legged animal. It sometimes hap- 
pens that every man who comes into 
the house is called daddy. When chil- 
dren invent their own words, these often 
have an enormous semantic range. Wil- 
helm Stern’s (7) son Giinther used psee 
for leaves, trees, and flowers. He used 
bebau for all animals. Lombroso (9) 
tells of a child who used gua qua for 
both duck and water and afta for drink- 
ing glass, the contents of a glass, and 
a pane of glass. Reports of this kind 
do not suggest that children are defi- 
Clent in abstracting ability. It even 
looks as if they may favor large cate- 
8ories, 

There are two extreme opinions about 
the direction of cognitive development. 
There are those who suppose that we 


begin by discriminating to the limits of 
our sensory acuity, seizing each thing 
in its uniqueness, noting every hair and 
flea of the particular dog. Cognitive 
development involves neglect of detail, 
abstracting from particulars so as to 
group similars into categories. By this 
view abstraction is a mature rather than 
a primitive process. The contrary opin- 
ion is that the primitive stage in cog- 
nition is one of a comparative lack of 
differentiation. Probably certain distinc- 
tions are inescapable; the difference be- 
tween a loud noise and near silence, 
between a bright contour and a dark 
ground, etc. These inevitable discrimi- 
nations divide the perceived world into 
a small number of very large (abstract) 
categories. Cognitive development is in- 
creasing differentiation. The more dis- 
tinctions we make, the more categories 
we have and the smaller (more con- 
crete) these are. I think the latter view 
is favored in psychology today. While 
there is good empirical and theoretical 
support (1, 3, 4) for the view that de- 
velopment is differentiation, there is em- 
barrassment for it in the fact that much 
vocabulary growth is from the concrete 
to the abstract. This embarrassment 
can be eliminated. 

Suppose a very young child applies 
the word dog to every four-legged crea- 
ture he sees. He may have abstracted 
a limited set of attributes and created 
a large category, but his abstraction will 
not show up in his vocabulary. Parents 
will not provide him with a conventional 
name for his category, e.g., quadruped, 
but instead will require him to narrow 
his use of dog to its proper range. Sup- 
pose a child calls all elderly ladies aunt. 
He will not be told that the usual name 
for his category is elderly ladies but, in- 
stead, will be taught to cut back aunt 
to accord with standard usage. In 
short, the sequence in which words are 
acquired is set by adults rather than 
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children, and may ultimately be deter- 
mined by the utility of the various cate- 
gorizations. This will sometimes result 
in a movement of vocabulary toward 
higher abstraction and sometimes a 
movement toward greater concreteness. 
The cognitive development of the child 
may nevertheless always take the direc- 
tion of increasing differentiation or con- 
creteness. 

The child who spontaneously hits on 
the category four-legged animals will be 
required to give it up in favor of dogs, 
cats, horses, cows, and the like. When 
the names of numerous subordinates 
have been mastered, he may be given 
the name quadruped for the superordi- 
nate. This abstraction is not the same 
as its primitive forerunner. The school- 
boy who learns the word quadruped 
has abstracted from differentiated and 
named subordinates. The child he was 
abstracted through a failure to differ- 
entiate. Abstraction after differentia- 
tion may be the mature process, and 
abstraction from a failure to differenti- 
ate the primitive. Needless to say, the 
abstractions occurring on the two levels 
need not be coincident, as they are in 
our quadruped example. 


SUMMARY 


Though we often think of each thing 
as having a name—a single name—in 
fact, each thing has many equally cor- 
rect names. When some thing is named 
for a child, adults show considerable 
regularity in their preference for one of 
the many possible names. This paper 
is addressed to the question: “What de- 
termines the name given to a child for 
a thing?” The first answer..is that 
adults prefer the shorter to the longer 
expression, This gives way to the fre- 


quency principle. Adults give a thing - 


the name it is most commonly given. 
We have now come full circle and are 
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left with the question, “Why is one 
name for a thing more common than 
another?” 

It seems likely that things are first 
named so as to categorize them in a 
maximally useful way. For most pur- 
poses Referent A is a spoon rather than 
a piece of silverware, and Referent B a 
dime rather than a metal object. The 
same referent may have its most useful 
categorization on one level (Prince) for 
one group (the family) and on another 
level (dog) for another group (stran- 
gers). The categorization that is most 
useful for very young children (money) 
may change as they grow older (dime 
and nickel). 

With some hierarchies of vocabulary 
the more concrete terms are learned be- 
fore the abstract; probably the most ab- 
stract terms are never learned first, but 
it often happens that a hierarchy de- 
velops in both directions from a middle 
level of abstraction. Psychologists who 
believe that mental development is from 
the abstract to the concrete, from a 
lack of differentiation to increased dif- 
ferentiation, have been embarrassed by 
the fact that vocabulary often builds in 
the opposite direction. This fact need 
not trouble them, since the sequence in 
which words are acquired is not deter- 
mined by the Cognitive preferences of 


children so much as by the naming prac- 
tices of adults. 
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“WHAT IS LEARNED?”—AN EMPIRICAL ENIGMA? 
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Although cognitive and stimulus-re- 
sponse interpretations of learning have 
traditionally been regarded as antipodes 
in the theoretical disputes of behavioral 
psychology, it has become increasingly 
apparent that there are few, if any, 
predictive behavioral differences between 
these positions. Superficially, one might 
expect gross disagreement between an 
S-R theory, which holds to a direct at- 
tachment of responses to stimuli, and a 
cognitive or “expectancy” theory which 
maintains that conditioned stimuli elicit 
Sensory processes, or surrogates thereof, 
previously associated with other stimuli. 
Yet when one reflects that the concept 
of “mediation response” (27) fan- 
ticipatory goal response” (15) aloes a 
stimulus to evoke a CR whose proprio- 


ceptive feedback provides a mechanism: 


by which a stimulus may have condi- 
tioned sensory consequences, it is no 
longer surprising that no one has yet 
succeeded in cooking up a “critical” ex- 
periment to settle the controversy. 

It is entirely too rash, however, to 
conclude with Kendler (20) that specu- 
lations as to “what is learned” are em- 
pirically meaningless. To do so is to 
overlook the crucial point that the known 
—or supposed—facts of behavior have 
driven S-R theorists from their older 
models without mediation responses to a 
more complex theoretical analysis which 
is, within rather wide limits, formally 


1 This article is based upon a dissertation 
submitted to the Faculty of the Division of 
the Biological Sciences, the University of Chi- 
cago, in partial fulfillment of the requirements 
for the Ph.D. degree. The writer wishes to ex- 
press his gratitude for the generous support 
of the National Science Foundation, whose 
fellowship program has subsidized the writer’s 
graduate studies. 
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identical with expectancy theory. There 
are important predictive differences be- 
tween the old, unmediated, S-R theory 
on the one hand, and mediated S-R or 
expectancy theory on the other. Hence, 
if by “What is learned?” we are con- 
cerned not with the connotative imagery 
associated with theoretical terms but 
with the formal properties of learning 
theories, it would appear that this ques- 
tion does have empirical significance. 
The fact that behavior theories of widely 
divergent conceptual origins appear rap- 
idly to be converging to a common for- 
mal structure strongly implies the exist- 
ence of certain empirical principles of 
behavior which necessitate this particu- 
lar structure. And if such is indeed the 
case, we are confronted with a problem: 
Exactly what are the empirical principles 
which give rise to mediational theories 
of learning? 

The question just posed is not as sim- 
ple as it might appear. Every graduate 
student of behavioristics is familiar with, 
eg., the “latent learning” experiments 
which were supposedly critical for ex- 
pectancy theories, and whose explana- 
tions in S-R terms draw so heavily on 
mediation responses. But an experi- 
ment is not an empirical principle; by 
the latter, we understand a lawful rela- 
tion among observation variables which, 
in conjunction with other empirical prin- 
ciples, determines the outcome of an ex- 
periment. To understand the empirical 
bases for mediation hypotheses, it is 
insufficient to know what experimental 
outcomes favor a mediation hypothesis. 
We must also know the laws which these 
results exemplify. 

Let me elaborate on this point, for it 
is an important one. There appear to 
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be two main approaches through which 
one can seek understanding of natural 
phenomena. One—the theoretic—is to 
hypothesize about the underlying unob- 
served entities or processes and their re- 
lations which are casually responsible for 
the gross observed effects. The other— 
the empiristic—is to seek discovery of a 
set of dependable empirical covariations, 
in terms of which more complex phe- 
nomena may then be analyzed. There 
is no need here to weigh the relative 
merits of these approaches.? What is 
important to note is that there is no rea- 
son to doubt that any behavioral phe- 
nomenon can be analyzed as an instance 
of one or more purely empirical prin- 
ciples, even though these, in turn, may 
call for explanation in theoretical terms. 
The importance of such “brute facts” of 
behavior for empiricist and theorist alike 
1s appreciated readily enough by reflec- 
tion on the position in behavioral psy- 
chology of the empirical laws of condi- 
tioning, both classical and instrumental, 
the facts about deprivation and activity 
level, etc. Such principles may be re- 
garded as what we know about behavior, 
in contrast to what we surmise (though, 
of course, in the last analysis this is less 
an absolute distinction than a matter of 
degree), and it is this hard core of facts 
—by which I mean not merely data, 
which are gathered readily enough, but 
the much less obvious empirical covaria- 
tions which these exemplify—upon which 
the science of behavioristics is erected. 

The question which I have raised is 
whether the problem, “What is learned?”, 
is not reflected on the empirical level as 
a question about the laws of behavior. 
I have already argued that the struc- 
tural convergence of alternate learning 
theories does, in fact, suggest this to be 


th dt, space permitted, I would argue that 
‘eoretical concepts are—within sharply cir- 
Cumscribable limits—not merely justified, but 
are more or less compelled by certain kinds of 
empirical data. 


so. The question still remains, however, 
as to whether the particular relations re- 
sponsible for mediation hypotheses are 
merely special instances of laws already 
known, or whether, perhaps, a previously 
unrecognized basic principle of behavior 
is involved. I shall attempt to demon- 
strate that the latter is, indeed, the case 
—or more accurately, that the empirical 
basis for mediation hypotheses is a fun- 
damental behavioral principle 7f, in fact, 
it exists. For, as will be shown, the 
problem at hand carries us into a new, 
essentially unexplored, dimension of re- 
search. 

I shall not attempt to display the em- 
pirical problem of “What is learned?” 
through analysis of the traditional latent 
learning and reasoning experiments. To 
do so would be to become lost in a mass 
of obscuring details, and to suggest that 
the principle for which we are seeking 
manifests itself only in special circum- 
stances. Rather, I shall demonstrate 
that the empirical possibility which gives 
rise to mediation theories of learning is 
an intrinsic aspect of each and every in- 
stance of conditioning phenomena. The 
known facts of conditioning are tradi- 
tionally phrased in a way that conceals 
a highly critical ambiguity concerning 
what are the facts of conditioning. In 
the next section, I shall attempt to for- 
mulate the conditioning paradigm in 
complete generality, and to exhibit the 
correlated empirical mediation problem, 
without whose answer a resume of the 
facts of conditioning is as incomplete as 
a specification of geographical location 
which gives latitude but not longitude. 
But to illustrate my point and to illumi- 
nate the general analysis, I will first cite 
a specific example. 

Perhaps the oldest empirical principle 
of behavioristics is that of classical con- 
ditioning. This may be expressed: 

(I) If an organism repeatedly en- 
counters, under suitable circumstances, @ 
stimulus, Se, immediately preceding a 
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second stimulus, Sy, which evokes re- 


sponse R, then S, will come to evoke R.. 


With some qualifications (chiefly in 
regard to the similarity of CR and UR), 
(I) may stand as a close paraphrase 
of conventional formulations of classical 
conditioning, and seems to assert no 
more than is justified by generalization 
from known data. But compare it with 
the following formulation: 

(II) If an organism repeatedly en- 
counters, under suitable circumstances, 
a stimulus, Se, immediately preceding a 
second stimulus, Sy, then Se will come to 
evoke the same response(s) as elicited 
by Su. 

Both (I) and (II) are equally sup- 
ported by the facts of conditioning, so 
far as these are known; yet (1) implies 
that classical conditioning establishes a 
bond between S. and R which is inde- 
pendent of the subsequent behavioral ef- 
fects of Sy, whereas (II) implies that 
any modifications in the response-evoca- 
tion properties of Sy will be passed along 
to S.—i.e., that classical conditioning es- 
tablishes a functional equivalence be- 
tween conditioned and unconditioned 
stimulus. Since the behavioral effects 
of the US have almost invariably been 
left unaltered in conditioning experi- 
ments, there is absolutely no justifica- 
tion for assuming (I) to be any more 
representative of the empirical facts 
than (II). Yet it is traditionally as- 
sumed by empiricists and S-R theorists 
that the conditioned reflex is a “direct” 
or “mechanical” attachment of response 
to stimulus—i.e., that (I) is the case. 
The truth is that we simply don’t know 
what the relationship is between the re- 
sponses to the CS and US subsequent to 
the conditioning operation, despite the 
fact that the value of this information, 
both for theoretical and applied pur- 

poses, can scarcely be overestimated. 

The moral that I want to draw is that 

our knowledge of the empirical facts of 
classical conditioning is incomplete—not 
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in the relatively trivial sense that we 
are not yet acquainted with all the pa- 
rameters of its occurrence, but in the 
extremely basic sense that we have only 
partial knowledge of its empirical conse- 
quences. Moreover, unless we explicitly 
stipulate that the behavioral effects of 
the US remain constant, it appears ex- 
tremely difficult to express what we do 
know about conditioning without inad- 
vertently prejudging the results of ex- 
periments yet to be conducted. (Such 
prejudgments undoubtedly account for 
our traditional assumption that a condi- 
tioned reflex, once established, is inde- 
pendent of the subsequent effects of the 
US.) But the relation between the re- 
sponse-evocation properties of the CS 
and those of the US constitutes an em- 
pirical principle yet to be determined, 
and it is the likelihood that a significant 
relationship does in fact exist, along 
with others of its kind, that has encour- 
aged the development of mediational 
theories of learning. 


THE GENERALIZED CONDITIONING 
PARADIGM 


The preceding example demonstrates 
how, for classical conditioning, the prob- 
lem, “What is learned?”, is not pri- 
marily a theoretical issue at all, but a 
straightforward query about the brute 
facts of behavior. As we shall now see, 
a similar problem exists for every form 
of conditioning. To show this in com- 
plete generality, it is first expedient to 
assimilate the various instances of con- 
ditioning—conditioned reflex, instrumen- 
tal conditioning, conditioned reinforce- 
ment, etc.—under a single rubric, 

Most, if not all, of those instances of 
learning to which the term “condition- 
ing” has been applied, as well as fore- 
seeable extensions of this term, may be 
subsumed under the formula of a stimu- 
lus, Se, acquiring certain behavioral ef- 
fects (for a given organism) through 
participation of S, in certain contin- 
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gency relations with other environmental 
events. By “behavioral effect” I mean 
any change, transient or enduring, ob- 
served or inferred, in the behavioral at- 
tributes of an organism which results 
from reception of the stimulus by that 
organism. The simplest and best known 
behavioral effect of a stimulus is, of 
course, response evocation, but more 
complex consequences of stimuli for be- 
havior are also known. One such effect 
is (instrumental) “reinforcement,” which 
confers upon a stimulus the ability to 
produce instrumental conditioning. Still 
another is drive arousal or reduction, 
which differs from simple response evo- 
cation in that the former is character- 
ized by a change in activity level which 
may be expressed in a variety of behav- 
jors. There is evidence that these ef- 
fects may be acquired and lost in man- 
ners vary similar to the acquisition and 
loss of response evocation. 

Since environmental events are pre- 
sented to the organism in terms of 
stimuli,’ we may formulate the condi- 
tioning schema as follows: A phenome- 
non is known as “conditioning” when it 
may be described as the acquisition by a 
stimulus, Se, of a behavioral effect, Ec; 
through participation of Se in a con- 
tingency relation, T(S., Sy), with an 

unconditioned” stimulus, Sy, whose be- 
havioral effects are Ey. (Although the 
origins of the term “unconditioned stimu- 
lus” lie in the innate autonomic reflexes 
of Pavlovian conditioning, it is termino- 
logically convenient to regard the stimu- 
lus through which conditioning is ac- 
complished as an unconditioned stimu- 
lus, even though it may have received 
its own behavioral effects through 4 


? Since behavioral manipulations of the Or- 
Sanism consist of environmental modifications, 
a ‘i iar oeadent variables of behavioristics, 1n- 
mae. drive operations, can always be for- 
tho ated in terms of stimulus situations, al- 
ee certain methodological complexities 

e in the case of the deprivations. 
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previous conditioning procedure.*) Let 
“E is a behavioral effect of stimulus 
S” be abbreviated “S— E.” Then if 
S.—E, and T(Se, Su) together caus- 
ally imply Se > Eo T(Sc, Su) is known 
as a “conditioning operation,” while if 
S.— Ee is due to a conditioning opera- 
tion, Se and E, are a “conditioned stimu- 
lus” and a “conditioned effect,” respec- 
tively. The role of Z, in the condi- 
tioning procedure seems to be that of 
determining the nature of the condi- 
tioned effect, E., since the conditioning 
powers of a given S, are altered as its 
E, is altered. 

It must clearly be appreciated that 
the generalized conditioning paradigm, 
so formulated, in no way asserts a law; 
it merely describes those generic em- 
pirical features which are exemplified 
by conditioning phenomena. So far, 
nothing has been said about the kinds 
of behavioral effects involved, nor have 
we specified any details of the condi- 
tioning operation, T (Se, Su). There will 
be as many species of conditioning as 
exist behavioral effects and contingency 
relations with the appropriate conse- 
quences. Neither, as yet, has the rela- 
tion between the unconditioned effect, 
E,, and the conditioned effect, Es, been 
specified; we shall turn to a detailed 
analysis of this following a brief sum- 
mary of the two major known types of 
conditioning operations. 

Classical, Pavlovian, or type-S condi- 
tioning. In this instance, the occurrence 
of the unconditioned stimulus, Sw is 
made contingent only upon the occur- 
rence of the conditioned stimulus, Se- 
So far as is known, classical condition- 
ing can be accomplished whether the be- 
havioral effect of S, is innate or itself 
conditioned, though for technical rea- 
sons, higher order classical conditioning 
is difficult to demonstrate empirically 

4It would be preferable to call Su the “con- 


ditioning” stimulus, but this makes for exces- 
sive confusion with “conditioned” stimulus. 
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(35, p. 244 f.), and is, perhaps, but 
weakly established (28). With the be- 
havioral effects of S, held constant, the 
conditioned effect, Æe, appears to be 
similar to the unconditioned effect, Eu, 
when E, is response evocation, positive 
or negative reinforcement, or drive evo- 
cation (e.g, 7, 9, 13, 19, 24, 35). 
Available evidence (e.g., 6, 22) tends to 
weigh against the conditioned acquisi- 
tion of drive reduction. 

Instrumental, or type-R conditioning. 
When the occurrence or cessation of a 
stimulus, Sw, is made contingent upon 
emission of a response, R, in the pres- 
ence of a stimulus, S,—i.e., the occur- 
rence or cessation of S, is dependent on 
the joint occurrence of S, and R—the 
subsequent likelihood of R as a re- 
sponse in the presence of S, may be al- 
tered, depending upon the reinforcement 
value of Su (e.g., 13, 19, 35). When the 
onset of S, contingent upon R in the 
presence of S, results in the increment 
of R as a response tendency to S,, then 
S, is known as a “positive” reinforcer. 
When it is the termination of S„ that 
strengthens S,— R, then S, is termed a 
“negative” reinforcer. The onset of a 
negative reinforcer contingent upon R 
in the presence of Sẹ is known to de- 
crease the strength of S,—> R, perhaps 
through instrumental strengthening of 
competing responses (e.g., 11). During 
instrumental conditioning, the reinforce- 
ment value and probably other behavioral 
effects (other than drive reduction) of 
the reinforcing stimulus also apparently 
tend to become conditioned to S,, un- 
doubtedly through a type-S process. 
Thus R is only a portion of the total 
effects, Ee, acquired by S, during instru- 
mental conditioning. (Note, inciden- 
tally, that classical conditioning may be 
conceived as a special, or degenerate, 
case of instrumental conditioning in 

which the instrumental response, R, is 
the null-response—i.e., Su is contingent 
upon S, plus no particular response.) 
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There is still much that is unknown 
about the parameters of instrumental 
conditioning, particularly in regard to 
the participation of drive. 


THE PROBLEM OF CONDITIONED 
GENERALIZATION 


In predicting the behavioral effects of 
a conditioned stimulus, it is insufficient 
to know merely the stimulus contin- 
gencies in which S, has participated; 
we also need to know the behavioral 
effects of S,. That is, E, is a function 
not merely of T (Se, Su), but also of Ey. 
But what do we know of this function 
empirically? 

The extent of our knowledge here has 
already been summarized (perhaps over- 
tersely) above. We know that so long 
as the behavioral effects of Sy remain 
constant, roughly E, = E, for classical 
conditioning,® and roughly E, = R + E, 
for instrumental conditioning, where R 
is the instrumental response (or, in cer- 
tain instances, its inhibition). But this 
leaves unspecified which behavioral ef- 
fects of Su determine E,, for, in prin- 
ciple, the effects of S, can vary with 
time. At a given moment, are the con- 
ditioned effects of S, determined by the 
effects of Su at that same moment, or by 
the effects of S, at the time of condition- 
ing? (There are, of course, still other 
possibilities.) Either alternative has as 


5 Cognitive theorists have shown an inordi- 
nate concern over the apparent fact that the 
classically conditioned response tends to differ 
somewhat from the unconditioned response. 
There is no more a priori reason why Ec 
should be identical with Ey than some other 
fixed function of Eu at the time of condition- 
ing—neither is any less “mechanical” than the 
other. The difference between E, and Eu lends 
no more special credence to expectancy theo- 
ries than does the even greater lack of simi- 
larity between Es and Ey during instrumental 
conditioning. What is important for a media- 
tion theory is whether the behavioral effect of 
Se, whatever this may be when Eu is held con- 
stant, shows any tendency to covary with Eu 
after conditioning. 
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much a priori likelihood as the other, 
and only data from experiments in which 
the effects of S, are altered subsequent 
to conditioning will permit decision be- 
twen them. 

The analysis so far has been con- 
ducted in a quasi-formalistic manner in 
an effort to exhibit the problem of the 
functional dependence of conditioned ef- 
fect upon unconditioned effect as, logi- 
cally, an intrinsic aspect of conditioning 
phenomena. Let me now put it straight- 
forwardly. When a stimulus, So, is con 
ditioned to a behavioral effect, E 
through contingencies of Se with an un- 
conditioned stimulus, Sy, whose behav- 
ae effect is Ey, what is the functional 

ependence, if any, of E. upon Ey after 
conditioning? For greater clarity, we 
may formulate the problem as an ex- 
perimental paradigm. 

Phase 1. Condition a behavioral ef- 
fect, Ee, to a neutral stimulus, Se, by 
making stimulus $,, with behavioral ef- 
fect E,, contingent upon Se alone, or 
upon a response, R, in the presence of 

o: 

Phase 2. Extinguish E, as an effect 
oh S, and condition to S, a new effect, 
Ey. Extinction of E, is not essential, 
since the objective here is only to modify 
the effects of Sy. However, extinction of 
Eu, when this is possible, tends to maxi- 
mize the difference in the effects of Su 
before and after modification, and hence 
to maximize evaluation of the results in 
Phase 3. 

Phase 3. Test Se for its influences 0” 
behavior. 

There are three major alternative Te 
sults which might be anticipated as the 
me in Phase 3. These are not 
wee exhaustive, but would seem to 
; ver the intuitive expectations of the 

ehaviorist. 

A. Stimulus S, may continue to have 

e as its behavioral effect, and have 
oa no new effect. ‘This indicates 

at under the parameters of the ©- 


periment, the behavioral effects of a CS 
are independent of the subsequent effects 
of the US. Such a result is commensu- 
rate with a nonmediational theory of 
learning. 

B. S, may have lost E, as an effect, 
but instead, have acquired effect Ec, 
where E’, is that behavioral effect which 
S, would have acquired during the initial 
conditioning had E'w rather than E,, 
been the effect of S, at that time. This 
would imply that, under the parameters 
of the experiment, the behavioral effects 
of a conditioned stimulus continue to 
be determined by those of the stimulus 
through which conditioning was achieved, 
so that, empirically speaking, the con- 
ditioning operation has established a 
stimulus-stimulus relationship in that the 
organism’s responses to So take account, 
so to speak, of the current behavioral 
significance of Sy. Theoretically, this 
would imply that the relation between 
a CS and its effect is mediated by a 
process whose behavioral properties are 
under control of the US. 

C. Se may retain effect Ee perhaps 
more or less attenuated, and also ac- 
quire some degree of E'e thus implying 
that the effects of So are only partly de- 
pendent upon those of S, under these 
parameters. The correlated theoretical 
interpretation is that the behavioral ef- 
fects of the CS are multidetermined, 
some involving a mediation process con- 
trolled by the US, and others being in- 
dependent of it. 

For convenient reference, it is desir- 
able to find a distinctive title for the 
experimental paradigm just described. 
Since Outcomes B and C display a de- 
pendence of the behavioral effects of one 
stimulus upon those of another as the 
result of a conditioning procedure, the 
name “conditioned effect dependency” 
provides a literal description but is un- 
pleasantly cumbersome. However, when 
the effects Æ, and 2’. of the conditioned 
stimulus are similar, respectively, to 


28 


those, #, and E’,, of the unconditioned 
stimulus, the much simpler term, ‘“con- 
ditioned generalization,” is also appro- 
priate. For the sake of descriptive ex- 
pediency, therefore, let me stretch a 
point and refer to the generic design as 
the conditioned generalization paradigm, 
even though the effects of the CS and 
US need not literally be the same. 
There are several important observa- 
tions to be made here. First, let me em- 
phasize once again that although the 
conditioned generalization paradigm is 
distinctive as an experimental procedure, 
the empirical question which it is de- 
signed to answer is not a problem sepa- 
rate from or secondary to the facts of 
conditioning. Quite the contrary, it is 
a question about the facts of condition- 
ing. As was illustrated earlier for the 
conditioned reflex, the empirical laws of 
conditioning are traditionally formulated 
in a way that negatively prejudges the 
existence of conditioned generalization, 
with the result that any experiment 
which appears to exemplify this effect is 
regarded as a special and complex in- 
stance to be explained away, if possible, 
in terms of more “fundamental” empiri- 
cal principles. But from the available 
data, there is no more reason to think 
that the primary empirical consequence 
of the conditioning operation is estab- 
lishment of a rigid Se —> E, connection 
than it is the formation of a generaliza- 
tion from S, to S,. We simply do not 
yet know what the facts are. 
Secondly, it is important to realize 
that the question of conditioned gener- 
alization is not a unitary problem to be 
answered by a single experiment, but a 
correlate to every instance of condition- 
ing. There is no particular reason to 
think that the degree of conditioned gen- 
eralization must be the same for every 
type of conditioning. In fact, judging 
from what data there are, it seems prob- 
able that the degree of conditioned gen- 
eralization is not constant even for a 
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given type of conditioning, but is highly 
dependent upon the particular parame- 
ters of the learning situation. 

The next point is an extension of the 
first two. Since every type of condition- 
ing is a conditioned generalization phe- 
nomenon (if only in a trivial sense that 
E, is a constant function of E,), the 
problem of conditioned generalization is 
quite literally a new dimension in the 
empirical study of learning, rather than 
merely an unexplored sector of an old 
framework. As such, the empirical prin- 
ciples which will emerge from the experi- 
mental study of this problem are bound 
to have powerful repercussions for our 
understanding of behavior. Not merely 
are the empirical facts of readily fore- 
seeable applied value; data concerning 
the dependence of conditioned generali- 
zation upon parameters yet to be dis- 
covered will also provide a new set of 
observation variables intimately corre- 
lated with intra-organismic processes for 
which at present we have only tenuous 
theoretical speculations. It is perhaps 
not unfair to claim that conditioned gen- 
eralization is one of the most important 
problems now confronting behavioristics. 

Finally, it should be apparent that it 
is the facts of conditioned generalization, 
whatever they may be, which solve the 
old enigma, “What is learned?” ® Far 
from being a theoretical blind alley, this 
is a straightforward question about the 


empirical consequences of the condition- 
ing operation. 


®Some writers (eg, 21) have interpreted 
the question, “What is learned?”, as concerning 
whether the conditioned Tesponse is a motor 
discharge or an achievement, However, this 
would seem to be more a generic problem in 
regard to the nature of the response than 
about learning as such, Since, as MacCorquo- 
dale and Meehl (21, p. 219) point out, S-R 
theorists are no more committed to, and in the 
main have no more subscribed to, a motor 
discharge conceptualization of the response 
than have cognitivists; the dispute over what 


is learned must lie in something other than 
this. 
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THE EVIDENTIAL Basis FoR CONDI- 
TIONED GENERALIZATION 


I have contended that it is belief in 
a principle of conditioned generalization 
that has motivated development of me- 
diation devices such as expectancies and 
fractional anticipatory goal responses. 
Since rigorous demonstration of the de- 
pendency of theoretical mediation ele- 
ments upon this kind of empirical co- 
variation calls for a technical and fairly 
extensive logical analysis, I shall in- 
stead substantiate this claim by show- 
ing that a large proportion of those ex- 
periments traditionally cited as critical 
for mediation theories are instances of 
the conditioned generalization paradigm. 
These may be grouped into three major 
categories: (a) “secondary generaliza- 
tion,” (b) “preconditioning,” and (c) 
a more loosely organized set of experi- 
ments which may be called “secondary 
instrumental conditioning.” 

Secondary generalization. In 1933, 
Shipley (32) reported that if, for hu- 
man subjects, an eyeblink is conditioned 
to a flash of light with a tap on the 
Cheek as the US, and a finger flexion 
is subsequently conditioned to the tap 
(shock as the US), the light flash will 
be found to evoke finger flexion. On the 
basis of this single experiment, Hull (16, 
17) proclaimed the principle of “second- 
ary generalization,” which hypothesizes 
that if two stimuli, Sı and S2, evoke the 
Same response, R, then another response, 
R’, conditioned to one of the stimuli, 
say Sı, will also be evoked by the other, 
S2. This formulation differs somewhat 
from that of conditioned generalization 
in that, for secondary generalization, S1 
and Sz have not necessarily had to par- 
ticipate in a common contingency rela- 
tion. However, the case of secondary 
generalization, wherein the stimulus S1» 
i which R’ is subsequently conditioned, 

as been used previously as the US for 
Conditioning R to Sə, as is true in the 
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Shipley experiment,’ is also, obviously, 
an instance of the conditioned generali- 
zation paradigm. 

Shipley’s experiment and results have 
been successfully repeated by Lumsdaine 
(13, p. 230), and a similar design by 
Graham (12), with dogs, also showed 
positive results. Thus the evidence is 
rather convincing that at least some 
degree of conditioned generalization is 
sometimes established during classical 
conditioning, although Razran (28, p. 
328) cites certain Russian studies that 
cast doubt on its ubiquity. These ex- 
periments do not permit evaluation of 
the strength of the effect, however; in 
particular, they do not permit decision 
between alternatives B and C of the 
conditioned generalization paradigm, nor 
whether conditioned generalization is the 
rule, rather than the exception, for clas- 
sical conditioning. It should be added 
that the Shipley-Lumsdaine-Graham ex- 
periments appear to be the only avail- 
able evidence directly relevant to the 
secondary generalization hypothesis, de- 
spite the important role this supposedly 
empirical principle has played in medi- 
ated S-R theory. The notion of sec- 


7 Hull (16; 17, p. 192 f.) discusses Shipley’s 
1933 data as though Shipley had not condi- 
tioned finger flexion to tap on cheek, and as 
though transfer of finger flexion to light flash 
were due to shock evoking both eyeblink and 
finger flexion. Such an interpretation would 
be highly favorable to the existence of second- 
ary generalization and the response theory of 
mediation, but it is definitely not in accord 
with the facts of the experiment. 

8 A later (1935) experiment by Shipley (33), 
in which Sı and Ss were separately conditioned 
to R, appears to support the existence of sec- 
ondary generalization as distinct from condi- 
tioned generalization. However, close analysis 
of the control data in this and the 1933 ex- 
periment reveals the 1935 results to be am- 
biguous. The preconditioning experiment by 
Wickens and Briggs (38), mentioned later, 
might be interpreted as an instance of second- 
ary generalization, but only if we are willing 
to regard human behavior in which a stimulus 
elicits a response as the result of verbal in- 


30 


ondary generalization has been occa- 
sionally invoked in other experimental 
contexts, such as generalization gradients 
among symbolically related stimuli, but 
more as an explanatory device than an 
object of experimental demonstration. 
Thus there are no grounds for believing 
in a principle of secondary generaliza- 
tion as distinct from conditioned gener- 
alization. 

Preconditioning. The preconditioning 
paradigm begins with repeated pairings 
of two supposedly neutral stimuli, S, 
and Sz, and subsequently conditions a 
response, R, to one of them, say Sj. 
Preconditioning is shown if S now also 
tends to evoke R. The preconditioning 
design is a special case of the condi- 
tioned generalization paradigm in that 
the behavioral effect of S, during the 
pairings of Sı and Sə is the null-effect, 
or, more likely, an unidentified effect. 
This phenomenon was demonstrated 
fairly conclusively by Brogden in 1939 
(2). Since then, a number of precon- 
ditioning experiments have appeared in 
the literature (1, 3, 4, 8, 18, 34, 38), all 
but one (3) reporting positive findings. 
But, the claims of Bitterman (1) not- 
withstanding, any attempt to evaluate 
the magnitude of the effect strongly sug- 
gests that the phenomenon is a minor 
one, at least judging from the published 
studies. Thus the empirical importance 
of preconditioning is still problematic. 

Secondary instrumental conditioning. 
In this category, we may include those 
experiments which, following establish- 
ment of an instrumental response, 
modify the behavioral effects of the re- 


inforcing stimulus. More specifically, 
the problem may be phrased: If an in- 
strumental response is conditioned to a 
stimulus, Se, through reinforcement by 
a stimulus, Sw, and the reinforcement 


structions as an instance of “response evoca- 
tion” in the same sense that we apply this 
phrase to infra-human behavior. 
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value of Su is now altered, will the re- 
sponse to S, remain unaffected, or will 
it have adapted in a manner commensu- 
rate with the new behavioral significance 
of Sa? When Su is positively reinforc- 
ing during conditioning, and is subse- 
quently extinguished of positive value 
or aversively reconditioned, this design 
may also be called “secondary extinc- 
tion” (29), “latent extinction” (31), or 
“nonresponse-extinction” (10). A typi- 
cal secondary extinction experiment con- 
ditions, say, a position habit in a T maze, 
one goal box of which contains food or 
water. The animal is then placed for a 
while directly in the positive goal box, 
which is now empty of the primary re- 
inforcer, and where the animal may also 
be shocked. The experimental question 
is whether the position habit will appear 
weakened on the next trial. The habit 
may be analyzed as a response chain 
held together by secondary reinforce- 
ment deriving from the sequence of 
discriminative cues. Reconditioning di- 
rectly in the goal box tends to extin- 
guish or aversively recondition the ter- 
minal reinforcers of the sequence, and 
the next maze trial reveals the immedi- 
ate effect on earlier links of the response 
chain. 

An experiment of this sort was first 
proposed by Tolman in 1933 (37), who 
obtained negative results. But shortly 
thereafter, Miller (23) reported that 
shocking rats placed directly in a dis- 
tinctive goal box inhibited a running re- 
sponse which had previously terminated 
in that goal box. Secondary extinction 
research subsequently languished until 
the appearance of a number of recent 
studies (5, 10, 26, 29, 30, 31), most 
of which, unfortunately, have technical 
flaws (cf. 26, 29) which vitiate their 
interpretive significance, However, the 
evidence seems fairly definite that, un- 


der at least some parameters of con- 


ditioning, a lever-pressing habit in rats 
is quite independent of the subsequent 
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value of the reinforcers through which 
the habit was established (29). 

An important variety of traditional 
latent learning research (type 5 [21, p. 
211]) may also be reduced to the sec- 
ondary-instrumental-conditioning rubric. 
In this instance, the organism is given 
experience with a maze, with eliminable 
reinforcers either absent, or equally dis- 
tributed to all end boxes. The animal 
is now introduced directly into one end 
box and given a previously unencoun- 
tered reinforcement, either positive or 
negative. The problem is whether or 
not the animal’s maze behavior will have 
adapted to the changed conditions on 
the next trial. This differs from more 
obvious cases of secondary instrumental 
conditioning only in that the role in es- 
tablishment of initial maze habits played 
by the stimuli whose secondary rein- 
forcement values are subsequently modi- 
fied is not so clear. When no known 
reinforcers are obviously influential in 
the initial maze behavior, this design 
may be described as “instrumental pre- 
Conditioning,” since it involves a stimu- 
lus, Są (say, an end box), as contingent 
on a response, R, in the presence of an- 
other stimulus, S, (the antecedent por- 
tion of the maze), but where S,, has no 
readily identifiable reinforcement value 
Prior to the reconditioning operation. 
A number of experiments of this sort 
have been reported (see 21, p. 211, for 
Summary; also, 14, 25, 36), with out- 
Comes about equally divided between 
Positive and negative. Thus again, the 
evidence appears to favor the existence 
Of conditioned generalization under some 
circumstances, but more explicit evalua- 
an of the empirical significance of these 
teeriments is precluded by the difficul- 
man in giving an unambiguous account 
2 the initial maze learning in terms of 

Onditioning principles. 
ant summary of the available evidence 
Ncerning conditioned generalization, it 

Ould thus appear that while the quan- 


tity of relevant data is very meager, 
they distinctly imply that parameters of 
conditioning exist under which at least 
some degree of correlation between the 
behavioral effects of CS and US is es- 
tablished. Unfortunately, the existent 
experiments happen, for the most part, 
to be of special designs which either 
employ a minimal or unknown uncon- 
ditioned effect, Eu, or do not compare 
the relative strengths of E. and E’, sub- 
sequent to reconditioning of Su. It is 
only when E, and E’, are strong and 
well defined, and the alternatives E, 
and E's are distinct and quantifiable, 
that we can decide whether the primary 
result of a conditioning operation is es- 
tablishment of a full-fledged generaliza- 
tion from US to CS, or whether the 
prevalent notion that conditioning at- 
taches a specific effect to the CS is 
basically correct, and apparent cases of 
conditioned generalization are secondary 
phenomena to be explained as special 
instances of more fundamental empiri- 
cal principles. It is earnestly to be 
hoped that the present analysis, by 
stripping from the empirical question of 
“What is learned?” its obscuring theo- 
retical superstructure, and developing a 
general experimental paradigm which 
isolates the essential features of the 
problem, will stimulate research in- 
tended not so much to test a theoreti- 
cal prediction as simply to learn the 


facts. 
SUMMARY 


I have contended that, far from being 
a blind alley, the question “What is 
learned?” is an extremely important em- 
pirical problem about the consequences 
of conditioning, and one which has been 
concealed by improper formulations of 
the known facts. We have traditionally 
assumed that a conditioning operation 
attaches a fixed behavioral effect, Es 
to the conditioned stimulus, Se, whereas 
in fact, the conclusion that S, will give 
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rise to E, subsequent to conditioning és 
empirically justified only so long as the 
behavioral effects of the unconditioned 
stimulus, Su, remain constant. The ques- 
tion then arises as to the extent to 
which, subsequent to the conditioning 
operation, the effects of S, are a func- 
tion of those of S,. I have elected to 
identify this as the problem of “condi- 
tioned generalization,” since, in one of 
its forms, it asks whether or not a con- 
ditioning operation establishes a gener- 
alization from US to CS. 

Since the empirical consequences of 
the conditioning operation constitute a 
problem extending over the entire range 
of conditioning phenomena, and one 
which has remained unexplored in the 
overwhelming majority of conditioning 
experiments, conditioned generalization 
defines a new dimension of behavioral 
research. What few data are available 
suggest that conditioned generalization 
does exist as a phenomenon, but are to- 
tally inadequate as a basis for judging 
whether it is a basic behavioral principle 
or merely a secondary effect. Because 
of the practical value of this informa- 
tion, and its crucial significance for the 
development of behavior theory, it is to 
be hoped that the conditioned generali- 
zation problem will soon be subjected to 
a concerted experimental attack. 
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PERCEPTION: EVOLUTION OF A CONCEPT? 
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Emory University 


This paper is essentially an examina- 
tion of theories of perception—espe- 
cially in their relation to general be- 
havior theory. If it at times appears to 
be little more than verbal masonry, this 
is because defining such a relationship 
is, in many respects, at present a se- 
mantic problem resolvable in terms of 
personal preference. 

Reference to general behavior theory 
typically implies formulations like Hull’s 
behavior principles applied to maze 
learning, the learning of nonsense syl- 
lables, brightness and form discrimi- 
nation, social learning, emotional dis- 
turbance, and certain perceptual phe- 
nomena, or the Gestalt principles of 
perceptual organization applied as an 
analogue to explain problem solving, 
and even the organization of overt re- 
sponses. These contrast sharply with 
theories like Granit’s dominator-modu- 
lator theory, and the Wever-Bray volley- 
principle which were devised to account 
for sense-organ function. 

The relation of perceptual to general 
behavior theory. The nexus between 
the two classes of theory under con- 
sideration is typically delineated on a 
background of metaphysical bias, though 
this is often confused by semantic issues. 
Some theorists have contended, as did 
the early behaviorists, that behavior 
is objective (publicly observable) and 
physical (involves muscles and glands), 


1Parts of the present paper were presented 
to the Graduate Psychology Colloquium of 
Purdue University and as part of a sym- 
posium on behavior theory at the second an- 
nual meeting of the Southeastern Psychologi- 
cal Association in the spring of 1956. I am 
indebted to Harry Helson and W. F. Dukes 
for helpful critical evaluations of the manu- 
script. 
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while perception is subjective (acces- 
sible only to the pereceiver) and men- 
tal (involves conscious experience) ; and, 
since scientific theories require objective 
data, perceptual theories ipso facto are 
not scientific. Thus there is no inter- 
theory relationship to describe. This 
Position is predicated on the assumed 
synonymity of “objective?” and “pub- 
licly observable.” An alternative posi- 
tion is to include as objective inferred 
events that stand in a logically consist- 
ent relationship with the publicly ob- 
servable. This means that the objec- 
tivity of a concept is determined by the 
efficiency with which it yields reliable 
replications of observations (9). Per- 
ception thus qualifies as an aspect of 
behavior, though always an inferred as- 
pect. Indeed, if one extends this line 
of reasoning, one must conclude that no 
instance of behavior is directly observed. 
Rather, it is always inferred by an 
intelligent observer. It is meaningful 
(i.e., evident as behavior) to the extent 
that the data of observation are similar 
in some fashion to data which the ob- 
server has previously used in construct- 
ing his conception of the world (1). It 
is, of course, not incompatible with this 
view to argue that gross bodily move- 
ments, since they do not require trans- 
formation by the subject into verbal 
form before becoming available data, 
are more objective than perceptual data. 

Having committed oneself to this lat- 
ter view—as I have—one is still faced 
with the necessity of defining the rela- 
tionship between perception and other 
aspects of behavior. The early psycho- 
physicists viewed perception as a simple 
mediating process—a constant in the 
transformation of information received 
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by the organism from a form that exists 
external to the organism to a form that 
is utilizable in overt response. Thus 
considered, there is no problem of per- 
ception beyond that of translating into 
Psychological terms—or, as some prefer, 
into physiological terms—the character- 
istics of the external world. Once the 
Constants of transformation have been 
determined (cps expressed as mels, db 
as sones, millilamberts as brils, grams 
as vegs, texture gradient as slant, etc.), 
the perceptionist’s task is complete. In- 
deed, since perception does not dynami- 
cally affect other aspects of behavior, 
ea may ignore perception and corre- 
ate stimulus input with some aspect of 
Overt response. 

Recent observers insist, however, that 
Perception is a function not only of 
te stimulus input and receptor 
Ee but also of motivation (which 
Te stimulus properties) and past stimu- 
Pea Helson (34) calls residual 
on i. Thus the perceptual mecha- 
fhe 4 a complex interactive one, and 
 apepeg and definition of the 
ats lons that influence it become 1n- 
a Saag to an understanding and 
a iction of both perception and overt 

ponse, 

a understanding perception is im- 
as to an understanding of overt 
Ser is an insight shared by mod- 
ie renomenological and behavior theo- 
ticles Approximately 25% of the ar- 
over the the PsycHoLocicaL REVIEW 
erce a past five years have been on 
‘a Ption. Floyd Allport (4), known 
Aea as an uncompromising be- 
Mer: of the unsentimental type, has 
iwin in the preface of his volume re- 
if thee Perceptual theories: “In them, 
Enna taken all together, lie the 
e a of our present knowledge of 
livin the most important activities 0 
Wo 8 beings.” In similar vein are 
tio OtWorth’s asserti “the condi- 
ioning wen ssertion that “the ¢ 
Xperiment is really concerne! 


with the establishment of a new per- 
ception” (73, p. 124), and Hilgard’s 
statement that “the learning theorist 
must be aware of the learning within 
perception as well as the role of per- 
ception in learning” (39, p. 332). One 
may even suspect that Skinner (60) has 
made a concession of sorts when he 
writes of the “interpreted stimulus.” 
Aim of the present paper. Having 
assumed a metaphysical bias concerning 
the status of perception in psychology, 
I might follow any of several paths of 
discourse for the remainder of this pa- 
per. I have simply chosen to review 
critically some ideas about the nature 
of perception that have been prominent 
during the last fifty years, at the same 
time recognizing that thumbnail sketches 
of theoretical orientations involve grave 
risk of distortion and oversimplification. 


AN OVERVIEW OF CONCEPTUAL 
DEVELOPMENT 


I have titled these remarks “Percep- 
tion: Evolution of a Concept.” It may 
be countered that while change in con- 
ceptualization is evident, this does not 
constitute evolution, and that while 
there is increased complexity in the 
concept’s definition, this is not the uni- 
directional unfolding which is charac- 
teristic of evolution—it would even ap- 
pear that some of our notions have 
cycled full circle. Indeed, close exami- 
nation of the recent history of thinking 
about perception prompts the suggestion 
that these changes might more properly 
be described as spiraling than as cir- 
For example, early definitions 
t only in the broad com- 
mon-sense fashion that the metaphysics 
of Locke et al. was empiricist. In the 
early behaviorist’s attempts to apply the 
systematization of science to psychology, 
the problem of perception was solved by 
eliminating the concept. Later system- 
izations (e.g., Hebb’s) have been in- 
creasingly rigorous, and at the same 


cling. 
were empiricis 
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time have dealt with phenomena cov- 
ered by the concept. Indeed, in Hebb’s 
system, perceptual development has cen- 
tral status. . 

When physiologizing in psychology 
was in vogue 35 years ago, perception 
was rejected as a mental phenomenon. 
Today, physiologizing is again in vogue, 
but physiological systems attempt to ac- 
commodate the behavioral facts of per- 
ception. Finally, early in the period we 
are considering, the role of experience 
in perception was broadly defined and 

accepted as a self-evident metaphysical 
principle. Later, it was repudiated by 
the demonstrations of the Gestalt move- 
ment. More recently, the importance of 
past experience has been reaffirmed and 
the nature of the relationship more pre- 
cisely defined, this time through me- 
thodical experimentation. 
Conceptual evolution can be seen, I 
think, in numerous ways. First, there 
has been an increasing sophistication of 
perceptual theory qua theory. Fifty 
years ago, perception was essentially 
defined by a single a priori proposition. 
Today, in any of the several theoretical 
orientations extant there are attempts 
to represent it by a logically related set 
of propositions, connected through ex- 
plicitly specified operations to the level 
of empirical generalization. Inspect, for 
example, the later hypothesis theory of 
Bruner (13), and Postman (56). Ad- 
mittedly, perceptual theory has not 
achieved the status of a physical theory 
like the dynamical theory of gases. Yet 
the recent successes of adaptation level 
in quantitative prediction make for ex- 
treme optimism (34). 

Secondly, the purpose of perceptual in- 
vestigation has come to include greater 
concern for the elaboration of theory 
through empirical study. Earlier, per- 
ception was a term used in the descrip- 
tion of the content of consciousness. It 
represented the compounding of sensa- 
tions and images of past experience. 


Later the study of perception meant a 
catalogue of demonstrations and princi- 
ples designed to explain, as Koffka put 
it in his Principles (46), why things 
appear as they do. Still more recently, 
the aim has been to construct a class of 
empirically based propositions that may 
be fruitfully used—usually along with 
other classes of proposition—in the pre- 
diction of a wide repertoire of behavioral 
activities. 

Thirdly, there perhaps has been a 
more clearly conscious appreciation of 
the role and status of models and a 
more discriminating use of them. One 
need contrast only the psychological 
atomism of Titchener (62) with the 
current physiological atomism of Hebb 
(32) or the application of physical field 
theory by the Gestaltists in the 1920’s 
with its present use in Kéhler’s work on 
visual figure currents (49, 50). 

Perceptual models have, for the most 
part, been material models. Even in 
psychophysics and in adaptation-level 
theory where the explicit model is for- 
mal, one can find, I believe, an implicit 
material model. Early in our fifty-year 
period, models were essentially static- 
descriptive. In modern psychophysics 
(31), in Gibson’s psychophysics of form 
perception (26), and in adaptation- 
level theory (34) the models are of 
these types. (It is not intended to im- 
ply that these are of a lower order than 
dynamic models, for, as Helson has 
pointed out, the trick in science is to 
express dynamisms mathematically and 
therefore statically.) But as perceptual 
theories became more functional—i.e., 
as interest is turned toward the relation 
between perception and other aspects 
of behavior—models became more fre- 
quently dynamic models: e.g., those of 
Woodworth (73), Tolman (64), Bruns- 
wik (17, 19), Ames and Cantril (e.g: 
21), Bruner and Postman (e.g., 13), 
and Vernon (65). Finally, the early 
models were molecular. As the holistic 
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properties of perception were empha- 
sized, and later the status of perception 
as behavior stressed, the use of molar 
models become prevalent (contrast the 
psychological atomism of Titchener [62] 
and the functionalistic theory of Bruns- 
wik [17]). As interest returns toward 
physiologizing and toward dimensional 
analysis in some areas of investigation 
and theory construction, molecular mod- 
els are again coming into use—e.g., the 
cell-assembly theory of Hebb (32). 
Fourth, the concept of perception has 
changed in complexity, and definition 
has shifted from essence to process. 
Early conceptualization centered around 
Simple sensory qualities and the mem- 
ory images which are vestiges of them. 
These elements were related in an un- 
Complicated fashion to a rather re- 
Stricted number of physical stimulus 
dimensions. In more recent formula- 
tions, a larger number of variables are 
included, and the nature of the inter- 
variable relationships is regarded as 
More complex. The relational char- 
acter of perception is stressed, inter- 
Modality effects and the role of back- 
ground described, and the contribution 
` nonsituational—i.e., motivational and 
earning—variables is emphasized. The 
View has changed from one of percep- 
tion as a static picture of the external 
world to perception as the resultant of 
a complex repertoire of interacting proc 
esses, 
Minal, one gains the impression of 
os increased variety of media used in 
ane perceptual studies. While the 
Ea NE, of investigations still employ 
au buman subjects, animals and chil- 
n appear to be used more frequently- 


Some PERCEPTUAL THEORIES 
IN OUTLINE 
ha following sketches are arranged 
Crono ur broad categories, roughly 
tee logical. This, however, 1S not to 
lewed as a rigorous classification, 


for many of these views overlap in some 
fashion or other. The reader should ex- 
pect a discussion of the place of percep- 
tion in recent learning theory. How- 
ever, we shall not deal formally with 
this matter, for learning theorists have 
done little more than recognize, albeit 
to different extents, the importance of 
taking cognizance of perceptual vari- 
ables. Skinner (60) dismisses percep- 
tion as conditional seeing, by drawing 
a distinction between discriminative be- 
havior and response to discriminative 
behavior. Berlyne (8) proposes a place 
for perception in Hullian theory as an 
intervening stimulus-producing response 
(attention is its momentary effective re- 
action potential), and Wickens (71) re- 
gards it as dependent upon the differ- 
ential reinforcement of certain magni- 
tudes within the heterogeneous stimulus 
complex confronting the organism. For 
Tolman (64) perception is one of three 
types of intervening-variable systems, 
characterized by categorizing dimensions 
and intensitive dimensions (qualitative 
and quantitative orders). Thus, unlike 
others, he provides a formal distinction 
between perception and other mediating 
processes, but unfortunately he does not 
make clear how these formal distinctions 
relate to the level of empirical observa- 
tion. Meanwhile, no theorist has yet 
reconciled the many facets of percep- 
tual data within his model for learning. 


I. Perception as Conscious Content 


Titchener’s views. Titchener, in his 
1910 textbook (62), described two types 
of perception—pure and mixed. Pure 
perceptions are multidimensional sensa- 
tions, combinations of simple sensations 
under the laws of attention. While the 
end result might appear to be unique 
and unanalyzable (e.g, the flavor of a 
peach), it is nothing more than the sum 
total of sensations evoked by various 
aspects of the stimulus object. Pure 
perceptions are rare, and could occur 
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only in a truly naive perceiver, for they 
involve the grouping of sensations un- 
der the laws of attention. Mixed per- 
ceptions are more common and also 
more complex, for they involve the ad- 
dition of images of past experience. 
These latter originally may be kines- 
thetic images, but as the organism ma- 
tures and his experience increases they 
derive from all modalities. Further- 
more, perceptions differ from sensations 
in more than complexity; they have 
meaning. Meaning in turn derives from 
context, that is, from the setting in 
which sensations occur, It is thus clear 
that meaning depends on the contiguous 
occurrence of more than one sensation. 
Meaning is an inherent property of per- 
ception, for the context that accrues 
from a situation is always the psycho- 
logical equivalent of its logical mean- 
ing. Meaning is carried by images as 
well as sensations and, indeed, these lat- 
ter may degenerate into a kind of sym- 
bolic shorthand, largely unconscious. 
In this view the primary concern is 
with the content of awareness. While 
perception involves what the Gibsons 
(29) have called enrichment, its literal 
aspect (the perception of the material 
properties of things) depends on sen- 
sory processes, and thus is an invariant 
and faithful representation of that part 
of the external world with which it is 
concerned. The aim of psychological 
investigation is the analysis of experi- 
ence through introspection. This, in any 
final analysis, is an exercise in catego- 
rizing, for the method provides no means 
of empirical verification or check of in- 
ternal consistency. The goal of psy- 
chology is met when these analytical de- 
scriptions are complete. The question 
of etiology and development as an em- 
pirical problem never really arises. 
James’ views. William James’ (45) 
conception of perception, like that of 
Titchener, is an enrichment theory. 
Like that of Titchener, it is concerned 


with the content of awareness. Percep- 
tion is the consciousness of particular 
materially present objects. Although 
sensations cannot be identified except 
in a logical sense, perceptions are rep- 
resented as the combination of sensa- 
tions and reproduced attributes. Sensa- 
tions have both intensity and extensity, 
“thingness” depending on the coales- 
cence of different sensations. But what 
we find here that we do not find in 
Titchener is emphasis upon the con- 
structive role of past experience. Every 
perception is acquired, and, indeed, as 
it is found in the adult, represents a 
long course of development of order out 
of chaos. Furthermore, every percep- 
tion is of definite and probable things. 
We perceive the thing—and this point 
is supported by extensive reference to 
illusions—“which would most usually 
have given us that sensation” (45, p. 
316). In fact, habits of perceiving be- 
come so thoroughly established that 
new and alternative perceptions are 
more and more difficult to achieve. 
“Old-fogeyism,” James contends, “is the 
inevitable terminus to which life sweeps 
us on” (45, p. 328). 

There is much in James that we see 
highlighted in later interpretations. The 
long course of development represented 
in perception has been emphasized by 
Hebb (32). James’ notions of probable 
perceptions and old-fogeyism are bril- 
liant anticipations of Brunswik’s prob- 
ability learning (15, 18), Ames and 
Cantril’s assumptions (e.g., 21), and 
Duncker’s functional fixedness (24). 
But he does not take the further step 
that the modern functionalists take by 
regarding perception as resulting from 
an interaction between perceiver and 
perceived. Perceptions are learned, but 
it is not clear that this involves any- 
thing more than the passive compound- 
ing of sensations and memories of sensa- 
tions, providing a kind of catalogue of 
pictures, each available for view on ap- 
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propriate occasion. This half-way view 
is perhaps an inevitable by-product of 
James’ concern with content. Mean- 
while, his discussion of illusions consti- 
tutes an armchair preview of the type 
of demonstration that Ames and his 
followers have popularized in the last 
decade. 

The Gestalt orientation. The Berlin 
Gestaltists were among the earliest to 
recognize the importance of asking why 
objects appear as they do. Their an- 
swer was simple and direct. Taking 
their lead from Ernst Mach’s conten- 
tion that what we perceive are things 
(48), they reacted against the elemen- 
talism of the structuralist, arguing that 
sensations are logical abstractions and 
not psychological phenomena. Thus, if 
perceptions are not constructed from 
sensations, they must be immediately 
given in experience, our nervous system 
organizing them out of what Einstein 
has called the “rabble of the senses.” 
There is assumed to be an isomorphism 
between brain events and experience, 
the structure of perception mirroring 
the structure of cortical organization. 
The Gestaltists’ rejecting of the elemen- 
talism of the structuralists also means a 
rejection of their empiricism. Koffka 
(46) completes the discussion of al- 
Most every major problem of perception 
(ea, figure and ground, phenomenal 
ee with a dialectical refutation 
empiricism. This, however, does not 
ei denying the possible significance 
(6 past experience. Both Wertheimer 
nee ba Kohler (47) recognize it as a 
an. le, but object to its indiscrimi- 
a application as a universal explana- 
logre Spee While the phenomeno- 
Whole description of perceptions 4S 
RS suggests a preoccupation with 
one content, the Gestalt theory 15 
cine = neurophysiological in orienta- 
TEA hile meaning is a complex phe- 
noted on, depending, as Köhler has 

(48, p. 73), on personal biog- 


raphy, certain meaning (that of object 
qua object) is given in the perception, 
and thus by the brain dynamics. In- 
deed, the problem of meaning is prob- 
ably what took the Gestalists away from 
peripherally biased structuralism to their 
theory of cortical organization. Curi- 
ously, their theory is still one of the en- 
richment type. The important transi- 
tion in thought is that perceptions, as 
the psychological counterpart of brain 
dynamics, are constructs, or, more Cor- 
rectly, operational links between physio- 
logical constructs and the level of em- 
pirical data. Gestalt demonstrations 
are the means by which the functional 
properties of central mechanisms are 
known. This is most clearly illustrated 
in Köhler and Wallach’s monograph 
(49) on figural aftereffects. 


II. Perception and Modern Psychophys- 
ics 

Grahams behavioristic psychophysics. 
Modern psychophysicists are, by and 
large, S-R psychologists. Beyond the 
assertion that their basic data are sim- 
ple discriminations, they leave the meta- 
physical issues to someone else. With 
a few brilliant exceptions (e.g., Helson’s 
work on adaptation level), their aim is 
an encyclopedia of quantitative relations 
between certain aspects of stimuli, de- 
fined as energy functions, and responses, 
similarly defined. As Graham points 
out (31), psychophysical functions are 
special cases of the general positivistic 
formula, R= F (a...2), where all 
variables are constant except stimulus 
dimension @. Special pains are taken 
to create situations which yield quanti- 
tative, easily interpreted results. There 
is no expectation that universal concepts 
can be generated. 

Several general criticisms have been 
leveled at the psychophysicists. First, 
it has been contended that the tradition 
of sensory psychology from which they 
spring has blinded them to the impor- 
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tance of central variables in behavior in 
the setting of the psychophysics experi- 
ment. Second, this same background 
tends to result in an overly simple no- 
tion of the basic determinants of per- 
ception. Third, the restricted designs 
demanded by psychophysical methodol- 
ogy preclude identification of interac- 
tions among the various determinants 
of perception, and thus limit the possi- 
bility of fruitful theorizing that might 
stem from the data that complex de- 
signs yield. Lastly, conditions of ob- 
servation are so artificialized in order 
to meet the aims of precise control that 
the data yielded have only limited ap- 
plicability. 

Such criticisms are basically state- 
ments of an opposing bias. Whether 
they are more than this waits on an 
empirical check of their validity. The 
psychophysicist must also, of course, 
answer them at the level of concrete 

‘data. Meanwhile, the kind of defini- 
tion of basic elements that this sort of 
approach gives is a necessary first step 
in the construction of a general theory. 

Helson’s adaptation-level theory. Ex- 
amination of Helson’s work over the last 
25 years shows it to be shaped by in- 
terests in the Gestalt definition of the 
problems of perception and in the psy- 
chophysicist’s aim of quantification. His 
recently introduced concept of adapta- 
tion level reflects particularly this latter 
influence. This he presents as the basis 
for a general theory of perception (34, 
35). In outline, it may be regarded as 
the basis for a general theory of be- 

havior. 

Adaptation-level theory begins with 
the assumption that the perceptual re- 
sponse may be represented as a func- 
tion of the difference between some pres- 
ent stimulus process and an internal 
standard derived from a pooling of the 
effects of previous stimulation. The 
status of this standard, expressed in 
terms of some quantifiable property of 
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the external stimulus situation, depends 
both upon the characteristics of stimuli 
(e.g., frequency of presentation, magni- 
tude, nearness in time, etc.) and of their 
background. It is given expression as 
a mathematical equation. Thus the 
model employed, unlike those employed 
in the majority of current perceptual 
theories, is formal rather than material, 
although it may be suggested that be- 
hind the formal model is the notion of 
the organism as a logarithmic computer. 
Since the model is mathematical, the 
work which it has inspired is necessarily 
static-descriptive in nature. But the 
concept is functional and thus evolu- 
tionary and dynamic in intent. 

Adaptation-level theory has functioned 
as a precise predictive tool in a variety 
of situations to which it has been ap- 
plied in the last 10 years. It has, for 
example, provided for the quantification 
of the shift in the neutral point with 
a change in standard in the classical 
psychophysical weight-lifting situation 
(33, 34). It has successfully reconciled 
phenomena like color constancy, con- 
trast, and conversion. It has made 
possible the quantification of concomit- 
ant changes in hue, brightness, and 
saturation (38). It provides a rational 
basis for the understanding of anchor 
effects, and may be applied fruitfully to 
the study of figural aftereffects (44). 
Recently, Helson has used it to scale 
the value of verbal terms with quantita- 
tive denotations, to demonstrate the ef- 
fect of the context in which the terms 
are used upon these values (37), and to 
investigate the effects of personal and 
group factors upon the expression of 
attitudes (36). 

Helson regards adaptation level to be 
the means for translating the notion 
of perceptual organization into objective 
and quantitative terms. In principle it 
does this. Successful application to the 
more complex situations that character- 
ize everyday life is thus basically an 
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empirical matter of identifying the vari- 
ables and their properties that are im- 
portant for organization—that is, of de- 
lineating in greater detail the three 
broad categories of factors that he 
identifies as stimulus, background, and 
residual. We are at present particu- 
larly short of knowledge of the non- 
stimulus variables. We have yet, for 
example, adequately to establish the 
criterja for differentiating relevant from 
nonrelevant stimulus input. When this 
1s accomplished, A.L. provides a means 
of predicting, within the class of rele- 
Kont variables, what constitutes equiva- 
ent stimuli. Before the advent of A.L. 
theory this could be accomplished only 
a posteriori, in empirical terms (10). 
rele of the parameters important to 
me are not yet unequivocally defined. 
is not always sufficiently clear, for in- 
Stance, what is background and what is 
residual stimulation. 
a, Adaptation level is one of several con- 
Ne currently directed toward the aims 
Of general theory. Its assumption of 
Pooling is also found in Brunswik’s 
Probability learning (17). If it is re- 
ea as a probability function, it is 
ae to; resemble strongly Postman and 
e ers concept of hypothesis (13), 
an itisa quantitative version of Can- 
ie S assumptive world (21). It is 
paved that the theory will be expanded 
rot it will consist not only of the 
a enee mula, but also of an entire set 
Si ated mathematical expressions that 
R give more precise formal definition 
a only to the reference level, but to 
cee amith reference to both in- 
Jaret and time—of relevant stimulus 
al ene complexity of relevant vari- 
TER e rigidity-plasticity of the judg- 
erence Sn the stability of the ref- 
within a the fineness of articulation 
E e judgmental dimensions, 20! 
ie that constitute the pet 
A rame of reference. Just how 
Y equations would constitute the 


formal set would depend, of course, on 
the nature of the phenomena to which 
the theory was applied. Such a set of 
equations would be analogous in func- 
tion to the set that relates work to force 
and distance, effort to distance and re- 
sistance, mechanical advantage to effort 
and resistance, and power to work and 
time in Newtonian mechanics, or to the 
set that relates pressure, volume, and 
temperature in the dynamical theory of 
gases. 

Gibson’s psychophysics of surface and 
edge. Gibson is concerned primarily 
with literal perception (27), that is, the 
perception of objects qua objects. Fur- 
thermore, his objects are the solid ob- 
jects of the three-dimensional visual 
world (26). Awareness of these, he as- 
sumes, is immediately given by virtue 
of a kind of isomorphism between the 
spatial properties of the visual stimuli 
and the retinal events to which they 
give rise. Two-dimensionality is thus a 
learned refinement of three-dimensional 
perception. One sees in Gibson’s the- 
ory, as in Helson’s, evidence of two ma- 
jor influences: the quantitative aim of 
psychophysics and the relational ap- 
proach of Gestalt psychology. He en- 
visages quantitative description where 
the elements are surface and edge rather 
than points of color. For him stimu- 
lation is ordinal, that is, defined by 
gradient relationships within the retinal 
process. 

There is an explicit nativism in Gib- 
son’s approach, for perception is re- 
garded as reflecting directly the proper- 
ties of the retinal correlate. But, unlike 
the Berlin Gestaltists, Gibson stresses 
the correspondence rather than the non- 
correspondence between the external en- 
vironment and our perception of it. 
Learning for him is specificity learning: 
the gradual awareness of greater detail, 
the enhanced capacity for discrimina- 
tion (29). Through learning, percep- 
tions acquire meaning. But perceptions 
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must be structured before meaning can 
be acquired, although some meanings 
may be embryonic in the same way that 
literal perception is. This enrichment 
of perception by meaning, however, is 
not, in the strict sense, perceptual 
learning. 

It is curious that while Gibson’s in- 
sistence on a psychophysics of object 
perception spells objective science, his 
distinction between the visual field and 
the visual world is pure phenomenology. 
His most valuable contributions are the 
notion of ordinal stimulation and his 
geometry of continuous surfaces. He has 
indeed made it clear that the retinal 
correlate is not a picture of the external 
world. Meanwhile, it may be contended 
that he has overestimated the veridi- 
cality of perception by his assumption 
of an identity between perception and 
judgment that is open to question (cf. 
his studies on size constancy [26]). 
Certainly his insistence that perceptual 
learning is specificity learning requires 
a distinction between perception and 
conception that cannot confidently be 
achieved beyond the verbal level. Fi- 
nally, his theory is a logical construc- 
tion that still awaits systematic experi- 
mental evaluation. 

Attneave’s application of the informa- 
tion theory model to form perception. 
The persistent need to quantify percep- 
tions typical of everyday life has, in the 
last several years, served to motivate at- 
tempts to apply the information-theory 
model to these phenomena. Attneave 
has been a leader in this approach (5, 
6). He suggests (5) that if perception 
is thought of as information handling, 
it can be immediately seen that much of 
the information provided is redundant. 

If the visual field were fractionated by 
a grid, redundancy would be found in 
connection with areas of homogeneous 
color, persistent direction or slope of 
contour, symmetry of design, or uni- 
form gradient of texture—indeed, any 


WILIAM BEVAN 


homogeneity of pattern. The Gestalt 
notion of good figure thus refers to some 
high degree of internal redundancy. The 
perceptual mechanism operates to mini- 
mize redundancy, encoding input into 
more economical form. Hence, identi- 
fiable forms may be communicated by 
simply presenting contour points of 
maximal curvature. 

This view, like Gibson’s, assumes an 
isomorphism between the spatial prop- 
erties of the external object and the 
organism’s perception, the faithfulness 
with which the perception matches the 
object depending upon the efficiency 
with which the perceptual mechanism 
scans and encodes the input. The the- 
ory thus implies an intact, completely 
developed scanner, the character of the 
perception reflecting essentially the char- 
acter of the input. As it currently 
stands, the model deals with redundancy 
associated with the geometrical proper- 
ties of the speciously present stimulus. 
There is yet no extension to the prob- 
lems of temporal redundancy and the 
effects of past experience. The learn- 
ing mechanism, when provided, would 
appear to be necessarily of the specific- 
ity type. At the moment there appears 
to be no room for new perceptions of 
the sort considered by the enrichment 
theorists—notably those of the “new 
look” variety. This, then, is a theory 
of literal perception. It is not clear 
how the scanning mechanism will be in- 
structed to respond when the properties 
of perception are not directly inherent 
in the physical object perceived. 


III, Recent Physiological Models of 
Perception 


Hebb’s cell-assembly theory of per- 
ceptual development. Webb has pre- 
sented a neurological theory of percep- 
tion in his Organization of behavior 
(32). While basically of the telephone 
switchboard type, its design provides 
the complexity and flexibility necessary 
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to account for the field properties of 
perception. Its major purpose, however, 
appears to be to provide a vehicle for 
explaining what Hebb regards to be 
the basic role of experience—particu- 
larly early experience—in structuring 
perceptions. Like Berlyne (8), he deals 
With the phenomena of attention in the 
same model. Like James (45), he sees 
even the simpler perceptions in struc- 
ture going through a long course of de- 
velopment. Like Werner (68), he rec- 
ognizes the role of motor processes in 
Perception. His model is an elabora- 
tion of Lorente de Nó’s concept of re- 
Ciprocal circuits, to which has been 
added an assumption of axon growth 
with exercise. (The status of this last 
assumption has been questioned. Sholl 
[59, p. 91] cites research by Haddara 
which reports that while large measur- 
able changes take place in the dendrites 
of cortical neurons of mice after birth, 
no difference could be detected between 
the neuronal development of normal ani- 
mals and those in which the eyes had 
cen enucleated at birth.) Spread of 
ANANA in the developing nervous sys- 
a is at first essentially random. With 
eo receptor stimulation, however, 
+ ope become organized into func- 
a nal units called cell assemblies. With 
ee exercise the stability of these 
aS is enhanced. As experience in- 
ea and is enriched, cell assemblies 
n ine into more elaborate temporal 
ence called phase-sequences. 
oo these notions, Hebb explains the 
nena properties of perception, 
ual Took perceptual learning, percep- 
ie ectivity and perceptual generali- 
enrich, In short, he accounts for both 
3 ings and specificity, as well as 
tual a relationship between percep- 
Hebb’ 4 other psychological functions. 
exercise heory is a brilliant intellectual 
teeni” in ela and insightful. At 
that it.” owever, it is difficult to feel 
Constitutes any more satisfactory 


explanatory device than some of the 
models we have already described. De- 
fining schemata in neurophysiological 
terms does not make them any more 
accessible or, for that matter, real. 
Granted that Hebb has elaborated the 
internal structure of his model in more 
specific detail than other theorists, and 
that he has described in terms of this 
model the nature of the several psycho- 
logical functions, he still has to provide 
a key for transliteration from the terms 
of his model to the terms of empirical 
observation. The chemist has opera- 
tions that enable him both to produce 
and to identify precise changes in mo- 
lecular structure. He can talk, for ex- 
ample, of adding a double bond at a 
certain position in a particular molecule. 
Can the psychologist talk in the same 
way about adding a closed-chain circuit 
to a cell assembly? While it would be 
manifestly unfair to expect this sort of 
specificity so soon from Hebb, his the- 
ory implies it; and unless a means is 
clear by which it ultimately may be at- 
tained, his picture of central mecha- 
nisms, no matter how detailed, is no 
more than metaphorical. 

Werner and Wapner’s sensory-tonic 
field theory of perception. This theory 
is a recent variant of Gestalt psychol- 
ogy. It begins with a need to account 
for what its authors refer to as the pro- 
jective nature of perception as well as 
its active, interactive characteristic. At 
the same time Werner and Wapner re- 
ject the possibility of a simple motor 
theory because they feel that such a 
view could not describe the mechanism 
that accomplishes sensory-motor inter- 
action (67, 68). Instead, they propose 
a sensory-tonic theory. Tonus, as they 
use the term, refers not only to the dis- 
tribution of muscular tension, but also 
to the level and pattern of visceral re- 
activity. Perception is a “total dynamic 
process” patterned not only by the na- 
ture of sensory input but also by the 
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tonic status of the perceiving organism. 
Sensory and tonic factors not only in- 
teract but are also dynamically equiva- 
lent. 
Werner and Wapner support this last 
contention by citing evidence from the 
recent neurophysiological literature that 
a simple dichotomizing of CNS func- 
tions into sensory and motor processes 
is difficult if not impossible. From the 
psychological literature they review data 
that have been traditionally catalogued 
under the label, “Unity of the senses,” 
and in this connection it should be noted 
that the tension effects they describe 
must be mediated through sensory feed- 
back, either visceral or proprioceptive. 
Their own experimental work demon- 
strates that the perceived spatial posi- 
tion of simple figures, the autokinetic ef- 
fect, and the perception of movement in 
static drawings can be influenced by the 
asymmetrical arousal of muscular ten- 
sion (30, 51, 66). For example, the po- 
sition of the vertical can be shifted left 
or right by electrically stimulating the 
neck muscles on the left or right side, 
respectively. Indeed, though they dis- 
cuss visceral tensions, and suggest that 
their theory provides a means of accom- 
modating the problem of motivation and 
emotion in perception (i.e., the projec- 
tive nature of perception), their obser- 
vations have, for the most part, been 
confined to the relation between tonus 
in the voluntary musculature and visual 
orientation. The notion of pooling of 
effects has already been seen in Helson’s 
adaptation-level theory, and Helson has 
suggested (34) that his quantitative ap- 
proach will facilitate an analysis of sen- 
sory interaction as well as an under- 
standing of its central mechanism. 
Perception in the ethological theory 
of instinctive behavior. The etholo- 
gists, a contemporary group of Euro- 
pean zoologists, have elaborated a the- 
ory of instinctive behavior that includes 
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some Gestaltist-influenced notions about 
the nature of perception (61). 

The term “instinctive response” covers 
a varying complexity of animal activities. 
For example, the simple swimming move- 
ments of the eel and the elaborate mat- 
ing ritual of the stickleback are both 
typed as instinctive behavior. The ap- 
pearance of an instinctive response de- 
pends upon the activation of an “Innate 
Releasing Mechanism.” The ease with 
which this mechanism is activated de- 
pends in turn upon motivation (physio- 
logically, a concatenation of hormone 
actions), internal stimulation, and in- 
trinsic nervous activity. Motivation, 
furthermore, is also susceptible to al- 
teration by certain external conditions 
called orienting stimuli. 

Actual arousal of the I.R.M. depends 
—except in conditions of intense mo- 
tivation when vacuum activities may 
occur—upon the animals being con- 
fronted by certain sign stimuli. The 
male stickleback in mating condition 
must perceive the swollen belly of the 
female before he executes the courting 
dance; the young herring gull must per- 
ceive the red patch on the mother’s bill 
before giving the begging response; the 
young gallinaceous bird must perceive 
the long-necked silhouette of the mother 
before giving the gaping response. These 
stimuli vary in complexity. They also 
have Gestalt character, and cooperate 
according to what Tinbergen calls the 
“rule of heterogeneous summation” 
(61). Thus perceptions, the theory 
implies, are immediately given, and de- 
pend upon species, age, sex, motiva- 
tional state, etc. Even in the instances 
where the ethologists recognize learn- 
ing (they give greatest attention tO 
Prägung, one-trial permanent learning 
that occurs during some critical period, 
usually early in the animal’s life), this 
does not include perceptual learning, 
but rather presupposes, particularly i9 
the case of Prägung, that perceptu® 
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organization is immediately achieved. 
This instinct theory, like all instinct 
theory, requires the assumption that the 
effects of learning are confined to the 
specific area of experience in which 
learning occurs. Riess’ (58) study of 
instinctive” nest building, as well as 
the extensive literature on learning set, 
raises doubt concerning the validity of 
such an assumption. 


IV. Modern Functionalist Views of Per- 
ception 

„ McDougall’s conception of percep- 
tion. The reader may question the ap- 
propriateness of including a brief dis- 
Cussion of McDougall’s view in this 
section. But McDougall was among 
the early protestants against a preoc- 
cupation with conscious content. To 
talk about contents, he insisted, is to 
lead to continual confusion. Percep- 
tion, rather, is an active interpretative 
Process of the mind (53). All human 
beings begin with a capacity for experi- 
encing. After repeated practice in per- 
ceptual thinking, all become capable of 
Increasingly refined and complex dis- 
crimination and, at the same time, of 
a elaboration of the colligation of 
pee signs confronting them. Per- 
it Ption is thus a form of conception and 
ik is impossible, as Adams reiterates 
dist (2), to make more than a verbal 
Hehe between them. McDougall’s 
pli Ussion of cognitive dispositions im- 
ine a basic unity in the organism’s be- 
the a But there is no suggestion of 
heer between specificity and 
he ie in perceptual learning that 
29) ibsons have recently insisted Upon 

pel poasorlirs view of perception as 
of hose, response. In the Woodworth 
ifferen (72), one notes some clear-cut 

Views Ba from the conscious-content 
andi escribed earlier. While some at- 
tent 1S directed toward conscious CON- 
, perception is seen primarily aS 4 


function performed by certain areas of 
the brain. It consists of neurological 
events that follow sensation and precede 
motor response. This neurologizing in it- 
self did not constitute progress over its 
contemporary arm-chair analysis clothed 
in terms of consciousness. But progress 
did lie in the shift in emphasis from 
content to function. Perception is a re- 
sponse and not a picture, an active ana- 
lyzing and interpreting of signals pro- 
vided in sensory stimulation, a knowing 
of the facts. As such, it transcends 
present stimuli and is a necessary ante- 
cedent to overt motor response. As an 
interpretative response it is both com- 
bining and isolating in function, the 
culmination of a process of discovery of 
information about the world. It is sus- 
ceptible to the regular laws of learning. 
This position is made clearer in Wood- 
worth’s 1947 note on conditioning (73). 
What is reinforced, he contends, is the 
perceptual response—something on the 
receptor rather than the effector side. 
Certain views of the world get fixed as 
a result of the consequences they have 
for the organism’s adjustment to its en- 
vironment. Thus the common impres- 
sion that perception is a passive affair 
derives from the fact that certain views 
rovoke almost immediate reaction. But 
when a new perception is in the making, 
it is seen to be a slow trial-and-check 
affair motivated by a basic will to per- 
ceive. 
Here, then, perception is not a pic- 
ture of the world, but a motivated proc- 
ess, part of the organism’s active search 
for adjustment in the world. Wood- 
worth’s view of perceptual learning re- 
sembles, in at least one aspect, that of 
James: the result of continued reinforce- 
ment must certainly be «old-fogeyism.” 
The description of perception as an ac- 
fies Woodworth with 


tive process identi ; 
the transactionalism of the functional- 
ists of the 1950's. But in Woodworth, 


as in James, the notion of a reciprocal 
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relationship between perception and ac- 
tion is not explicit. While perception 
as an antecedent to motor response most 
certainly influences the latter’s charac- 
ter, the alternative possibility is not 
made clear. 

Brunswik’s functionalism. Brunswik 
may be classed among the modern func- 
tionalists in perception. He has left us, 
however, no highly elaborated theoreti- 
cal structure, for his preoccupation was 
with methodological issues. It was his 
conviction that the nature and magni- 
tude of S-R correlations must be estab- 
lished before a theoretical superstruc- 
ture can be safely put up. His func- 
tionalism is indicated at the level of 
theory construction by his concept of 
ecological validity, and at the level of 
methodological implementation by his 
notions of representative design. It is 
also seen in the correlational charac- 
ter of his work and in his interest in 
probability learning—particularly in his 
work on perceptual probability learning 

(18). Here it is clear that he meant to 
relate the very structure of perception 
to pattern of past experience. His work 
on distal focusing and the constancies 
(16) reveals a bias toward veridicality, 
but this he correctly sees as a matter 
of degree, not as one side of a theoreti- 
cal issue. Brunswik’s view is relational- 
organizational in character. His views 
on probability learning and his concern 
with the relation of perception to goals 
and aims resemble aspects of James and 
of Helmholtz. There is an obvious 
similarity between what takes place in 
probability learning and in the develop- 
ment of Bruner and Postman’s families 
of hypotheses (13) and of Cantril’s as- 
sumptive world (21), though Brunswik’s 
notions are probably more precisely for- 
mulated. Finally, we may suggest that 
probability learning is a quantitative 
complement of adaptation-level. 

The transactional approach of Ames 

and Cantril. The view evolved by 


Ames, Cantril, and their students (21, 
42) contains elements similar in some 
respects to a number of the views 
previously described—e.g., James (45), 
Brunswik (17), Woodworth (73), Hebb 
(32), and Helson (34). Indebtedness 
to Whitehead (70) and to Dewey and 
Bentley (23) is expressly recognized. 

Perception, according to the transac- 
tionists, is a process by which an indi- 
vidual person, with a particular outlook, 
attributes significance to events in his 
behavioral environment. These signifi- 
cances are established through the past 
attainment of goals. Put more simply, 
perception is part of the process of in- 
terpreting present stimulus cues from 
the bias of expectation. Expectations, 
or assumptions, are the consequents of 
past experience and action. As they are 
repeatedly confirmed or reinforced (and 
it is appropriate to speak of reinforce- 
ment, since the criteria of evaluation 
are provided by the purposes of the in- 
dividual perceiver), corresponding facets 
of perception become more stable and 
harder to alter—although changes within 
specific situations may be dramatically 
effected by a single failure of confirma- 
tion. 

While there is no detailed treatment 
of the generality of an assumption and 
its strength in a particular situation, 
one may expect that persistent failure 
in confirmation in some particular situa- 
tion may result jn weakening of the 
assumption for other situations, There 
is, this group points out, almost an in- 
finity of physical arrangements (equiva- 
lent configurations) which will produce 
the same pattern on the retina (43). 
These are thus all seen alike, and what 
is seen is what one has been familiar 
with through the transaction of living- 
Rooms may be Constructed in a variety 
of physical shapes. All will have the 
same perceived shape—the cubicle of 
everyday experience. This perception 
can change dramatically, however, whe? 
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the viewer checks his assumptions about 
shape by feeling about the room and 
finding it not to be of normal shape. 
Presumably, if failure of confirmation 
of the assumption of cubeness occurred 
frequently enough, our normal response 
to the equivalent configurations for 
rooms would also change. The totality 
of assumption, strong and weak, that 
the perceiver holds about his surround- 
ings constitutes his assumptive world. 
The physical world must thus remain 
an inference. Perceiving, not a passive 
imprinting of a world picture, is one 
aspect of an active transaction between 
the striving organism and its environ- 
ment. Perception, therefore, is neces- 
sarily the product of learning. It is not 
immediately given by the properties of 
the physical world and receptor organ. 
The demonstrations devised by the trans- 
actionists, when considered together, em- 
phasize the lack of correspondence be- 
tween the physical and phenomenal 
worlds: perception is nonveridical, but 
lt is also stable. It is, of course, not 
exactly appropriate to speak of non- 
veridical perception, since we are not 
dealing with a noncorrespondence be- 
tween the real world and one’s impres- 
Sion of it, but rather with noncorre- 
Spondence between inferences about the 
World described in phenomenological and 
In physical terms. 

_ The transaction theory has been criti- 
Cized for ignoring the possible role of 
Stimulus configuration in the structur- 
Ing of perception (27). But neither the 
theorists nor their critics have created 
a crucial situation for establishing the 
ee of either stimulus configuration 
earning. It has also been contended 
a the transactional view fails to pro- 
f le for perceptual learning of the speci- 

city type. This may be countered by 
painting out that strengthening of ase 
ar aons is a matter of differential re- 

: orcerent, and it is exactly with this 

Chnique that specificity is established 


(e.g, in the conditioned salivary re- 
sponse of the dog to tones). When 
strength of hypothesis is defined in 
terms of probability statements, we note 
the similarity of this view to that of 
Brunswik. When the probabilities are 
described as weighted in terms of rele- 
vance, the possibility of formulation of 
hypotheses in terms of norms, in the 
manner of Helson’s adaptation-level, be- 
comes apparent. Specificity is then de- 
finable in terms of variable error about 
this norm. 

Bruner and Postman and the “new 
look” in perception. The late 1940’s 
and early 1950’s saw a long and con- 
troversial series of experiments directed 
toward testing the functionalist propo- 
sition that perception is an instrumental 
activity serving the needs and wants of 
the individual perceiver. Bruner and 
Postman were leaders in this movement. 
It was their conviction that we perceive 
what we need to perceive and what 
we are used to perceiving (14). Au- 
tochthonous factors were accepted as a 
class of perceptual determinants, and it 
was recognized that there are situations 
where the properties of the stimulus 
configuration leave nothing more to be 
determined in perception. But it was 
maintained that autism in perception 
typically is strong. For example, when 
it was argued that the “new look” ef- 
fects are produced in the laboratory un- 
der conditions of extreme stimulus im- 
poverishment, the reply was that the 
conditions of viewing in everyday life 
(eg, lack of specific expectation, the 
momentary glance, etc.) are conditions 
of stimulus impoverishment. The moti- 
vated perceiver is alert to the presence 
of personally relevant objects and, al- 
though on certain occasions recognition 
of their presence may be withheld from 
consciousness, he recognizes them more 
readily, tends to overestimate their size, 
etc. Thus accentuation, fixation, vigi- 
lance, defense, resonance, and selectivity 
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were presented as explanatory principles 
to take their place along with closure, 
symmetry, proximity, similarity, and 
good continuation. 
The earliest attacks on the “new look” 
approach were criticisms of technique. 
Failures at replication were described, 
and flaws in experimental design pointed 
out (54). But the theoretical formula- 
tion has also been attacked as vague 
and equivocal. Luchins (52) rejects 
the distinction between formalism and 
functionalism, marshaling a great mass 
of data that demonstrate perception to 
be stable and veridical, and insists that 
the importance of rational processes in 
perception has been neglected. Hoch- 
berg and Gleitman (40) object that 
the Bruner-Postman type of theorizing 
makes explanations possible simply by 
elaborating, post hoc, the list of motives 
presumed to underlie perception, Howie 
(41) claims that the theory does not 
enable one to determine when one func- 
tional principle gives way to another, 
and Prentice (57) suggests that to de- 
fine perception in the way that Bruner 
and Postman do is not only to inflate 
it beyond its traditional meaning but 
to reduce it to meaninglessness. 
Bruner and Postman have come to 
recognize their principles (perceptual 
defense, accentuation, etc.) as descrip- 
tive rather than explanatory (13, 56), 
and have proposed a more general ex- 
pectancy-type theory. According to this 
later formulation, perception begins with 
expectancies or hypotheses. Each hy- 
pothesis represents the arousal of a 
complex of cognitive and motivational 
processes by an environmental clue. 
The present stimulus situation provides 
“information” to the expectant organ- 
ism. This is evaluated with expecta- 
tion as the criterion. If the initial hy- 
pothesis is confirmed in some critical 
degree, it is strengthened; if not, it is 
set aside for a new hypothesis. Hy- 
pothesis strength depends on frequency 


of past confirmation, the number of al- 
ternative hypotheses available, and cer- 
tain cognitive, motivational and social 


consequences. Like Berlyne, Bruner and ` 


Postman attempt to account for the ac- 
tive, directive characteristic of percep- 
tion by proposing a mechanism that 
unites its “pictorial” aspects with set. 
If one thinks of hypothesis strength in 
terms of relative frequency of reinforce- 
ment, one recognizes the striking simi- 
larity between this view and that of the 
transactionist. Meanwhile, while ex- 
pectancy is the basic functional prin- 
ciple, it is not clear how it relates, 
within the framework of the theory, to 
other psychological processes like the 
mechanisms of specific motives and of 
the several cognitive processes, e.g., 
memory. Distinction between hypothe- 
ses and other psychological processes 
at the operational level in specific in- 
stances of observation appears to be mo- 
mentarily extremely difficult. Brunswik 
(17), meanwhile, has made a defini- 
tional distinction between perception and 
thought in terms of uncertainty-geared 
and certainty-geared functions. Opera- 
tional distinction was demonstrated by 
presenting a size-constancy task like, on 
the one hand, a problem in direct per- 
ceptional observation, and, on the other, 
an arithmetic-reasoning problem. In the 
first case, responses were compactly and 
fairly normally distributed, and indi- 
cated approximate size constancy. In 
the latter case, there were many on-the- 
dot correct responses, but also many €x- 
treme and bizarre errors indicating Con- 
fusion and the shortcomings of logical 
extension from misinterpreted cues. 
Vernon’s application of the concept of 
schema. Vernon’s theory of perception 
is a functional theory that stresses the 
stability and orderliness of the perceiv- 
ex’s conception of the world (65). She 
is critical of the recent trend towar 
studies of motivation and perception for 
ignoring the intelligent operation of the 
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perceiver’s background of knowledge in 
structuring perceptions, and for overesti- 
mating the transient effects of needs. 
She begins by recognizing that the ex- 
ternal world consists of orderly arrange- 
ments of objects, and that these give 
rise to a stimulation pattern that is or- 
dered. Meanwhile, she rejects Gibson’s 
asumption (26) that ordinal stimulation 
gives immediate rise to the finished per- 
ceptual product. It is here believed 
that perceptions are the end products of 
a slow process of construction through 
manipulation of and experimentation 
with the external world. While admit- 
ting that the slow perceptual develop- 
Ment suggested by Piaget (eg, 55) 
May, to some extent, be attributable to 
Maturation, she insists that specific ex- 
Perience is essential to the perceiver’s 
Conception of himself and the world. 
Through experience is evolved a system 
Pa integrating and interpreting the in- 
ormation provided in stimulus input. 
o vation plays a major role in both 
ie differentiation and elaboration of 
ese classificatory criteria. 
= This theory is, of course, an applica- 
Si of Bartlett’s concept of schema to 
Sie process of perceiving (7). Its re- 
TE pae to Bruner and Postman’s Cy- 
een, hypothesis formation, hypothesis 
* Ait pier and hypothesis reformula- 
tants clear. Unlike Bruner and Post- 
= : theory, however, it stresses the 
sa, €rm orderliness and veridicality of 
i no Long-term orderliness 1S 
seen in the transactionists’ notion 
Bien assumptive world, but the insist- 
a veridicality is missing. Fur- 
com ore, to make the notion of schema 
ie eee to the concept of assump- 
Postulere it would be necessary to 
Schema par some superordinate 
ous pe Sa would integrate the vari- 
aving te pu Bonet (e.g; od 
Proper oo with temporal and spatia 
Cañe S, those dealing with their s18- 
es, etc.) into a single functional 


system. The long time necessary for 
the elaboration of a schema suggests 
that its physiological counterpart might 
be Hebb’s cell assembly. 

While Vernon has taken pains to dis- 
cuss the relation of schemata to both 
perception and motivation, she has not 
made quite clear the properties of 
schemata, defined either in terms of de- 
tails of theoretical structure or opera- 
tions which differentiate these details at 
the empirical level. How are schemata 
differentiated from perceptions, memo- 
ries, motivations? How are some sche- 
mata distinguished from others? How 
are they functionally related? What 
are the functional properties of sche- 
mata—their rigidity, scope, complexity, 
etc.? Until questions like these can be 
answered, we have achieved only a way 
of looking at perceptual phenomena, 
not a theory of perception. 

Allport’s concept of structure. In his 
recent book (4), Floyd Allport evalu- 
ates some 13 perceptual theories and 
formalizes his own views. In all theo- 
ties reviewed he finds two common 
limitations: ineffective handling of the 
problem of meaning, and unwarranted 
preoccupation with quantitative Jaw. 
More basic than this latter, he contends, 
is another type of law, that of event- 
structure. 

Structures are defined through a ge- 
ometry of on-goings and their inter- 
relations at event-points. Time is de- 
fined, not as a linear function, but as 
the duration of events that close them- 
selves through a cycle of operations. 
On-goings are activities distributed both 
spatially and temporally; event-points 
are points of “contact” among “on-go- 
» Structures are not randomly or- 
but may be predicted 
Events, since they 


ings. 
ganized patterns, 
in probability terms. A 
are points of contact, have no dimen- 
sions; they simply occur. Hence loci 
within a structure are event-regions of 
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space through time in which events may 
or may not occur. 

Perceptions are dynamically operating 
structures. Perceptual aggregates re- 
semble Hebb’s cell-assembly-phase-se- 
quence scheme (32), save that Allport 
appears to push the notion beyond the 
microscopic level of the material hypo- 
thetical construct to the submicroscopic 
level of the nonmaterial, purely inter- 
vening variable. Should the theorist be 
able to formalize structures in unam- 
biguous denotable terms, he would be 
able to achieve a fruitful, integrated 
theory. This is an aim that the present 
writer does not see that Allport—despite 
compulsive elaborateness and sometimes 
excruciating logic—has attained. One 
comes away from reading Allport with 
the feeling that he has been concerned 
more with translating the concepts of 
competing perceptual theories into the 
idiom of his own viewpoint and less 
with the construction of a set of sen- 
tences that will provide for the explana- 
tion and prediction of a repertoire of 
specific perceptual phenomena. When, 
for example, the constancies are dis- 
cussed, it is in terms of finding a basis 
for Brunswik’s probabilistic weighting 
of cues in the density of meaning-cy- 
cles and in interstructurance, When 
phenomena are analyzed (e.g., thirst- 
quenching behavior), one is likely to 
feel that Allport has done little more 
again than translate the elements of the 
observational sequence into the nomen- 
clature of his hoop (cycle) analogy. In 
short, his perceptual theory (and there 
is none distinct from general structure 
theory which is applicable to all natural 
events) is so abstract and is stated in 
such broad terms that it contributes no 
more than a general bias—granted that 
this may eventually prove quite fruit- 
ful. Certainly it appears to lack the 
articulation needed in a theory that is 
going to be intimately tied to the work- 
a-day activities of the laboratory. 
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THE LOGICAL STATUS OF CURRENT 
PERCEPTUAL THEORY 


Early conceptualizations of perception 
are marked by a concern with conscious 
content; later views regard perception 
as a class of response, different from 
other classes in accessibility (it is mg 
ferred rather than observed) and in 
function (it is an intermediary rather 
than a consummatory response). Per- 
ception as intermediary response was 
first assumed to stand in a unilateral re- 
lation to consummatory acts; more re- 
cently the relationship has been described 
as interactive. Definitions have been 
framed in phenomenological, physiologi- 
cal, and psychophysiologically neutral 
terms. Perceptions have, on the one 
hand, been equated with simple dis- 
criminations, and, on the other, identi- 
fied as events so complex as to defy 
ready replication. Early use of the 
term included reference to both the 
physical properties of an object and its 
significance; later, primary concern was 
with object qua object; currently inter- 
est has again turned to the meaningful 
object. Earlier studies dealt primarily 
with the encyclopedic task of analyzing 
and cataloging the properties of percep- 
tion, generically defined; modern studies 
are most frequently directed toward 
schematizing the interplay of factors 
that give rise to some particular set of 
perceptual responses. While some cur- 
tent work stresses the autistic proper- 
ties of perception, most of it emphasizes 
veridicality. Early theorists regarded 
learning to be of paramount importance 
in determining perceptual structure; the 
Gestalt theorists of the middle period 
de-emphasized the importance of learn- 
ing, with their powerful demonstrations 
of the organizing factors; modern theo- 
rists have restored learning, particularly 
early learning, to a position of promi- 
nence. Except in modern psychophys- 
ics, attention has been currently di- 
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rected toward independent variables of 
a central rather than peripheral sort. 
go adequacy of current data for 
cory construction. The adequacy of 
available perceptual data for the con- 
struction of a general theory of percep- 
tion has been questioned, for one reason 
or another, A common criticism leveled 
o early work was that the criteria 
or identifying the perceptual response 
R basically individual and inside the 
$ server, This actually meant that the 
Mteria for identifying the data lacked 
[iat and that the status of the 
ie with regard to the requirement of 
a Pechivity was undetermined. Of 
the re all perceptions have in common 
lie act that they can be made pub- 
ein as discriminations, and, un- 
Pa nately, the methodological reaction 
ae introspection has resulted, in 
n cases, in a highly restricted defini- 
die of discrimination. Until recently, 
nae designs have usually al- 
with na only a two-category response 
might but one of the dimensions that 
SA affect the response held constant. 
MaS. the objectivity of the resulting 
Such P beyond question, the value of 
Multidin ene for the creation of the 
curre ae one interactive theories 
signs { y favored is unclear—such de- 
(17) requently lack representativeness 
Cm But before we can achieve data 
a ate with such a view, per- 
Structy must be dimensionalized. The 
banish b of the underlying mechanism 
Provid e defined fully by studies that 
relevant for only the identification of 
necessi Vätiables, although this is a 
t aed first step. This is tantamount 
Proach g that the psychophysical ap- 
One? fees be exploited, regardless of 
Order ees bias, before higher- 
aChieveg eories can be competently 
is cer: tion—a theoretical concept. It 
Dreoccy probable that the long-time 
Pation with conscious content 


resulted from a failure to recognize 
the distinction between perceptual phe- 
nomena and perception as a theoretical 
concept. But as Tolman says (63, p. 
« 

111), “The fundamental essence of any 
hypothesis is not, it would seem, whether 
it be conscious or not but the fact that 
it is a condition in the organism which 
sets the latter, whether human or sub- 
human, for a certain systematic selec- 
tivity of behavior.” 

Logically, perception is a disposi- 
tional concept, and, as indicated by the 
occurrence of controversies like nativ- 
ism versus empiricism, organization ver- 
sus stimulus-compounding, and autism 
versus veridicality, a dispositional con- 
cept cannot be defined by any fixed to- 
tality of reduction sentences. It is al- 
ways possible that more will be added 
as observational situations change. In 
the language of the experimenter, these 
reduction sentences constitute a class of 
operational definitions. Hence it fur- 
ther follows, as Garner, Hake and Erik- 
sen (25), and Bridgman (12) before 
them, have pointed out, no concept can 
be given definitive definition by a single 
operation. Overlooking this fundamen- 
tal requirement has led many investiga- 
tors to the misimpression that opera- 
al validity is, in the final sense, sim- 
ply a matter of arbitrary definition. 
Thus perception has been equated with 
simple discrimination. This View has 
yielded clear-cut experimental results, 
put under such restricted conditions 
that their generality is unclear. Fur- 
ther, it has meant an inability to dif- 
iferentiate between the properties of a 
perceptual mechanism which we infer 
to exist and those of receptor and effec- 
tor systems. Thus distinction between 
sensation and perception is a priori, and 
made purely at the semantic level. (It 
is recognized that there are levels of 
communication—i.e., in an achievement- 
oriented psychology like Skinner’s with 
the white rat—where the distinction be- 


tion: 
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tween central and peripheral mechanism 
is not necessary.) Garner eż al. (25) 
cite perceptual defense as a recently 
promulgated concept that is ambiguous 
because the experiments from which it 
was derived failed to achieve opera- 
tional clarity. But lack of clarity prob- 
ably results also from a lack of ap- 
preciation of the function that the con- 
cept is to perform. (See, for example, 
Gibson’s paper, “What is Form?” [28], 
in which he describes 10 different con- 
cepts of form encompassed in three gen- 
eral categories of definition.) The re- 
marks of the present paragraph are 
not intended to imply that perceptions 
should not be identified with discrimi- 
nation for, as was pointed out ear- 
lier, discriminatory responses are the 
only means by which perceptions are 
made accessible as objective data. But 
the concept, as it has evolved from hu- 
man studies, implies multidimensional- 
ity. Thus, it requires objective defi- 
nition by a system of discriminations, 
as Boring suggests (11), made explicit 
through experimentation, if only such 
experiments are performed. 

A general weakness of perceptual theo- 
ries. Perception as a dispositional con- 
cept depends as much upon assumptions 
associated with the invented mechanism 
as upon the empirical data to which it 
is operationally linked, 

Perhaps the most widespread meth- 
odological defect of perceptual theories 
(and this holds for psychological theo- 
ries in general) is the absence of a set 
of statements that relate propositions 
about the model to reduction sentences 
(what Campbell Picturesquely called 
the “dictionary” [20]). Even where 

the model (e.g., Hebb’s cell-assembly 
model) is delineated in detail, these are 
more or less Jacking. Perhaps a promi- 
nent exception is Köhler and Wallach’s 
formulation of the satiation theory of 
figural aftereffects. Failure to relate 
data systematically to characteristics of 


the mechanism has meant an inability to 
relate various properties of the mecha- 
nism to each other. Thus, it is difficult 
to make a scientifically meaningful dis- 
tinction between sensation and percep- 
tion, and perception and conception, of 
to differentiate operationally between 
the processes of attention and those of 
perception. Unless such theoretically 
grounded, operationally specified dis- 
tinctions can be made there is no justi- 
fication for these terms in the formal 
language of psychology. 

Prospectus. Now that charges have 
been drawn, what must be done to im- 
prove the status of perceptual theory? 
Armed with the equipment of modern 
logical analysis, one might attempt 4 
tour de force, creating a highly ar- 
ticulated theoretical structure, catalog- 
ing and correlating reduction sentences; 
placing assumptions about the model in 
propositional form, and identifying the 
conditional reduction sentences neces 
sary to convert propositions into state- 
ments about data. Such an exercise 
would probably not come off. Science, 
as Conant points out (22), is not made 
that way. Unlike a system of formal 
logic, it is not closed. Rather it is al- 
ways in the status of becoming—of un- 
covering new data and incorporating 
new concepts. 

It may be advantageous, however, tO 
explore our dependent variables, both 
intervening and otherwise, as dimen- 
sions to be scaled. Only after this i 
achieved can the characteristics of 0U“ 
data be tied to the structure of oU 
model thoroughly enough to provide 
high-level explanations. My persona 
preference at this stage of psychology F 
development is for formal constructs: 
These do not commit us to a physiol 
that may some day trap us. (As Adam 
[3] insists, a psychical system may 
just as objective as a reflex arc oF 
cell assembly.) They can be conceive” 
of, first, in simple terms with few 
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mensions, and expanded and made more 
complex as data are acquired, without 
risking the necessity of having some day 
to be totally abandoned in the face of 
Some impressive piece of incompatible 
evidence. And finally, their properties 
are easily conceived of as dimensions to 
be scaled. The possible fruitfulness of 
such an approach is indicated by Hel- 
son’s work on adaptation level. 
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DISCRIMINATION ? 


EDWARD J. GREEN 
Darimouth College 


It has frequently been noted during 
the early phases of discrimination 
training that response probability in- 
creases in the presence of negative 
instances, or S4’s. That is, respond- 
ing in the presence of such stimuli does 
not decrease monotonically in strength 
as is the case in simple extinction. 
Skinner has attributed the temporary 
increase in S4 response strength to 
induction (14). Induction is defined 
as a consequence of there being com- 
mon stimulus properties between S?’s 
and S*’s.2 The greater the similarity 
between the stimulus sets, the more 
induction takes place. The more in- 
duction present during discrimination 
training, the more difficult is the estab- 
lishment of a sharp discrimination. 

One of the major contributions of 
the Bush and Mosteller paper on stim- 
ulus generalization and discrimination 

(2) is the formalization of the proper- 
ties of the discrimination situation. 
To recapitulate somewhat, they cate- 
gorize subsets of stimulus elements as 


1 The writer is greatly indebted to Jie Lr 
Snell and J. G. Kemeny of the Department 
of Mathematics at Dartmouth College for 
assistance and encouragement in the develop- 
ment of the model, 


2 Two conventions will be used 
the remainder of this paper. In place of S? 
and S4, S and S’ will be employed respec- 
tively to facilitate identification of parameters 
in this and the Bush-Mosteller paper. In 
addition, in the treatment of discrimination 
learning, the term trial, as employed by Estes 
et al., is used in a broad sense to include the 
presentation of discriminative stimuli in a 
series. Rate of response has been shown to 
be directly related to probability of response 
as treated herein, and this usage makes evi- 
dent certain possible extensions of the model 

to the free operant situation. 


throughout 


shown in Fig. 1, where S is the set of 
stimulus elements which constitute 
the occasion for reinforcement if a re- 
sponse occurs. S’ is the set which is 
the occasion for nonreinforcement if 
the organism responds. T is the inter- 
cept of Sand S’. It should be noted 
that I exists always in discrimination, 
whether or not the experimenter so 
contrives it. For example, the inter- 
cept includes all uncontrolled stimu- 
lation common to both S and S’ stim- 
ulus sets, in addition to whatever the 
experimenter may or may not have 
contrived experimentally to be com- 
mon to both sets. 

Certain objections can be raised to 
the Bush-Mosteller model in regard to 
the number of constants to be esti- 
mated, and in particular to the state- 
ment that the measure of the intercept 
decreases as discrimination learning 
increases. Clearly the physical nature 
of the stimulus set is unchanged by the 
fact that an organism has come to 
make a discrimination. The present 
development owes much to the Bush- 
Mosteller formalization, but takes its 
roots in the independent though par- 


1 


5 S 


Fic. 1. Set diagram indicating overlap u) 
between stimulus elements present on occa- 
sions for reinforcement (S) and elements 


present on occasions for nonreinforcement 


(S). 
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allel learning model of Estes and Burke 
(5). As in both the Bush-Mosteller 
and the Estes-Burke models, condi- 
tioning and extinction are conceived 
of simply as probability transfers be- 
tween response classes. In this treat- 
ment the additional simplifying as- 
sumption is also made that behavior 
other than that under immediate con- 
sideration is mutually exclusive of that 
behavior. In simple conditioning the 
mean probability of response on the 
nth trial has been defined to be: 


Pa =1-— (1 — p) — 0)". (1) 


Symmetrically, in extinction, proba- 
ility of response is: 


Pa = Po(1 — 80)", (2) 


Where fo is the initial probability of 

response, or operant level; @ is the 

Sampling ratio, or the probability of 

Sampling a particular element; and % 
€notes the number of trials. 


Tur MODEL 


crip asically, one may conceive of dis- 
sete tn as being the simultaneous 
Stim, Ccessive (depending upon whether 
ally) = ate presented in pairs or seri- 
eha Conditionings and extinctions of 
Rican. appropriate to the sets of 
ment a an ployed. ; The present treat- 
€ntati eals primarily with serial pres- 
simte but can be adapted to the 
Priate aneous procedure. It is appro- 
Scribes th observe that Equation 1 de- 
Probab e expected change in response 
Subset a associated with stimulus 
Scribes la and that Equation 2 de- 
Ssocjat, Neoretical changes in behavior 
for the ted with S’—J. Were it not 
model s ntercept elements, the present 
rica oe simply predict symmet- 
Moray oo in S and S’, but the be- 
omp; ic: dynamics associated with 
a Stes a ies picture. 
More p d Straughan (6) developed 
8eneralized function to describe 


the results of sampling from the same 
stimulus set in both conditioning and 
extinction trials. The function is: 


Bn =T — (m — Po) (1 — 0)”, (3) 


where m represents the relative fre- 
quency of reinforced trials in the stim- 
ulus series. One may regard this 
function as representing a special case 
of discrimination, where probability of 
sampling a stimulus element is the 
same whether or not it is in S or in 
S', and where discrimination between 
S and S’ is impossible. Within this 
framework, r is the asymptote which 
response probability approaches. 

8 is defined as the probability of 
sampling a particular stimulus ele- 
ment. In the original paper by Estes 
(3), the sampling ratio was also de- 
fined as the ratio s/S, where s repre- 
sented the number of elements sampled 
on a given occasion, and S represented 
the size of the population to be sam- 
pled. It is assumed that the number 
of elements sampled on a given occa- 
sion, s, remains constant. With 0 so 
defined, it is seen to be determined 
both by organismic and by environ- 
mental factors. For example, should 
the organism sample exactly one ele- 
ment on any given trial, and should 
the population to be sampled consist 
of exactly one element, then 6 = 1.00. 
If, on the other hand, the population 


d consist of twenty elements, the 


houl 1 
waite of 0 would be 0.05. — Thus @ is 


seen to depend upon environmental 
factors. Should the organism suffer 
impairment of its discriminatory appa- 
ratus, one might predict that the num- 
ber of elements which it is capable of 
iven occasion is re- 


mpling on a g! 
ee: Thus the effect upon the value 


of the sampling ratio of impairing the 

discriminatory apparatus — bs 
able to increasing 

Pa ulation. Burke, Es- 


timulus pop 
ra a Hellyer (1) have shown that 
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rate of conditioning is inversely pro- 
portional to signal variability. In- 
creasing signal variability, in other 
words, is one way to decrease the 
sampling ratio. Similar observations 
have been made in a more conven- 
tional discrimination experiment (8). 
Once it is assumed that 0 is a param- 
eter determined both by organismic 
and by environmental factors, one 
may examine the consequences of as- 
signing different values to 0 under 
conditions of reinforcement and non- 
reinforcement. Let7 = 1 — r, 6: being 
the sampling ratio for S elements, and 
62 the sampling ratio for S’ elements. 
Consider a single element in the 
intercept. One can form a Markov 
chain with States 0 and 1. State 0 
means that the element is not condi- 
tioned, and 1 that it is (11, Chap. 7, 
Sec. 5). The transition matrix for this 

chain is: 

0 1 


Of—- 78, Th 
P= 
l Toz 1 — 70: 


The entry pij™ of P” gives the prob- 
ability that the system is in State j 


after n steps if it starts in State 4. 
But 


1 
a TO, + To 
0 1 
0 (702 — TOA”, 7, — TOA” 
x 
1 lire — TOA”, TO + FOA” 


where à = (1 — 11 — #62) is the prin- 
cipal eigenvector for P. 

The mean response probability with 
a single stimulus element is p if 
the element was initially in State 0, 
and pu if it was in State 1. Assume 
now that there are NV stimulus ele- 
ments in the intercept, a fraction, o, 
of which are conditioned ; i.e., in State 
1. Then the mean probability of a 


response becomes: 
Pn = po-pu™ + (1 — Popa (4) 
which is 


2 Th 
701 + 702 


É [ 704 p 
Th + TO. 72 
X (1 — zb, — 702)". (5) 


It can be seen that this last expres- 
sion reduces to Equation 3 when 
9: = 42. It follows that probability 
of response in S is a weighted mean 
of Equations 1 and 5 if we somehow 
take into consideration the fact that 
the intercept elements are presented 
within every stimulus presentation in 
the series, and S—I and S'—I are 
presented with frequency r and #, re- 
spectively. By altering Equation 1 to 


Pa=1-— (1— p) (1—0), (6) 
and Equation 2 to 


Bn = po(l — 03)», (7) 
one accom 


5 plishes the necessary correc- 
tion. 


Now Pa refers to a response 
probability in the presence of the nth 
stimulus, regardless of whether or not 
it is in S or S’. Where m equals 7, 
I is operated upon twice as often as 
either S—I or StI If we define 
Pa(S—I) to be pn of Equation 6, and 
let 5a (I) be 5n as in Equation 5, then 


Pal(S)=kPn(S—D+(1—z)p,(1) (8) 
and, if we let p,(S’—J) be pù, as in 
Equation 7, then peas 
PalS) =k Da(S'—I)+ (1k) p) (9) 
where k denotes the 
ments in S—J, R! 

elements in S’—T, and (1—k) and 
(1—k’) the appropriate proportions © 


elements in I for § and S’, respec- 
tively. 


proportion of ele- 
the proportion O 


—.2 tad 
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D 
PROPERTIES OF THE MODEL 


The present model predicts the char- 
acteristic increase and decrease in S’ 
response probabilities under certain 
special conditions. If jo is greater 
than 0.00 and less than the asymptotic 
probability for intercept elements, and 
if the total number of elements within 
the intercept constitutes an appreci- 
able proportion of the total in S’, then 
the model generates the S’ peak. The 
increase in S’ response probability is 
alan when the conditioning to the 
ie Roe is more rapid than 
ae bap to the aT elements. 
hee the asymptote for intercept ele- 
oe n is approached closely enough 
Hien a rate of conditioning decreases, 
an extinction to S'—I elements be- 
ae evident, and S’ response proba- 
ty le to reverse and decline 
Bi ae value. The model pre- 
Biter hat no initial increase would re- 

ee were equal to or greater than 
10, + FO, , since p,(.S’) would decrease 
Cog Onically, Likewise, if po were 
à pen ) would rise monotonically 
ome terminal value. 
Since 


Pals) = 1 = eN 
1 2 


0 
Mal ) 
us (= + 762 (10) 


wh Na+ N-n | 
mente A ) is the number of ele- 
Of eleme ~and No is the number 
alter e in J, one can empirically 
and va ombe of elements in S—- 
fal asymp! and, by evaluating emp!” 
o a determine the extent 
Weighted uncontrolled stimulation is 
tio zy within Z. Such a specifica- 
theo akes contact with information 
» Since intercept elements CON- 
nie in discrimination. 
Pidity with which the increase 


Sti tute 


10. 


y 


s o vs 20 25 30 35 40 45 


„Fic. 2. Effects in appearance and mag- 
nitude of the S’ peak as a function of the 


value of the slope constants. 


in pa(S’) occurs, when it does occur, 
depends upon the slope parameters 
(1-02) and (1— T01 — 702). As shown 
in Fig. 2, the smaller the 0 values, the 
more delayed will be the appearance 
of the peak within the series, and the 
greater will be the rise in the peak. 
One final comment is perhaps due. 
The term stimulus induction has been 
used synonymously with stimulus gen- 
eralization (9). It seems reasonable 
to propose that induction be reserved 
in the present analysis to refer to the 
dynamics of successive conditioning 
and extinction within the intercept. 


This leaves generalization to the more 
static conditions treated in the Bush- 
Such a definition 


Mosteller paper. 
seems more in line with Skinner’s orig- 


inal usage of the term. 


DISCUSSION 


Since @ values depend both upon 
mic and environmental factors, 


lation of either of these has 
Specifically, if 


organis 
manipu 
corresponding effects. l 
the sampling ratio is reduced, either 
by increasing the effective stimulus 
field or by interfering surgically wi 
the organism’s discriminatory app? 
ratus, the S’ peak will be delayed in 
he stimulus 


its appearance within t 
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Fic. 3. Displacement of empirical S’ peaks 
resulting from reducing the total number of 
stimulus elements to be discriminated. 


series. On the organismic side, Dr. 
Karl H. Pribram, after inducing tem- 
poral lesions in rhesus monkeys, re- 
ports both the expected delay in the 
appearance of the S’ peak and the 
expected differential in magnitude of 
the peak in the behavior of operated 
animals as compared with normals. 
Data from previous studies (7, 8) sup- 
port predictions of the model concern- 
ing the appearance of the peak as a 
function of manipulations of the stim- 
ulus field. Figure 3 shows the effect 
upon the position of the peak of in- 
creasing the sampling ratios. The in- 
creased sampling ratio was produced 
by reducing the total number of ele- 
ments to be sampled in S’; the effect is 
an earlier appearance of the peak. 

An experiment was performed ex- 
plicitly to evaluate the reality of the 
model’s predictions with regard to 
presence and absence of the peak.’ 
Ten subjects were run in each of two 
groups. In each group an equal num- 
ber of cards with S and S’ patterns 
were shown the subjects. The sub- 
jects were required to state when they 


3 As reported in his unpublished manuscript 
entitled, “On the neurology of thinking.” 

4 Acknowledgment is due E. P. Stichman 
who conducted experimental work in the early 
phases of this study. 


believed an S pattern was being pre- 
sented ; they were informed when they 
were correct. The series of patterns 
was shuffled between experimental 
runs, so that each subject was exposed 
to a different series of patterns. The 
patterns were composed upon cards 
upon which were drawn ten circles in 
two lines of five circles each. In 
Group I, one particular circle was 
always filled in if the pattern was an 
S. A different circle was filled in to 
define S’ instances. Three other cir- 
cles, comprising the intercept, were 
filled in in various combinations of 
one, two, or all three; and these were 
presented equally often with either S 
or S’ cards. In Group II, the S and 
intercept circles were filled in, as in 
Group I, but S’ of this latter group 
consisted of the remaining six circles, 
one of which was always filled in. In 
brief, the technique is the same as has 
been used in a free operant situation.® 
Under both conditions, the subject 
is faced with a guessing problem at the 
outset, hence an estimate of .5 for Po 
seems not unreasonable. The model 
makes explicit determinations of sev- 
eral constants for these experimental 
conditions, as is shown in Table 1. 
Consequently it remained to esti- 
mate but a single parameter; either 01 
or bz in the Group II condition, since 
both @: and 02 of Group I are equal to 


TABLE 1 


DETERMINATIONS OF THEORETICAL CONSTANTS 


Constant 


Group 
= Ot 
S|) | oe) ek | we |) Ae 


g -18 | .18 | .50 | .25 | 25 .50 
I | .18 | .03*| .50 | .25 | .67 .86 


* Estimated empirically. 


* A fuller report of this experiment is being 


prepared by the writer, 


: 
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A, of Group II; and since all of these 
are six times as great as 62 of Group II, 
62 of Group II was estimated to be .03. 
Two principal points were examined 
in this experiment: one being the pre- 
diction that the equal @ case should 
generate S and S’ curves which were 
symmetrical about Jo; Sand S’ curves 
in the unequal 6 case should, however, 
be characterized by a more rapid ac- 
celeration to S asymptote than is true 
in the equal 0 case, and an S’ peak 
should occur which should be followed 
by a slow decrease to S’ asymptote. 
rik acy 4 demonstrates the gross re- 
ation obtained between theoretical 
curves and the data gathered in this 
experiment. 
ee second prediction was that, aside 
e m the gross characteristics of the 
aoan reasonably accurate predic- 
bi ne concerning exact response proba- 
‘he ies should be rigorously made by 
= model. The theoretical distribu- 
on for the curves is not known, so a 
x* test of goodness of fit was per- 


toy 


GroupI 


Ss 
s 


id Jo 
n 


ntal Theoretical z d experi- 
w] curves an 

Vhere 9 Sults where 6; = 0» Group Í and 
1 = 68, Group II. g 


formed. In Group I the levels of con- 
fidence for Sand S’ were (.6 < p < .1) 
and (.7 < p < .8), respectively. In 
Group II they were (.9 < p < 1.00) 
for both curves. Admittedly the fit 
is best for the condition from which 
the estimate of bz was taken, but fit- 
ting all four curves through the esti- 
mation of one parameter with the 
degree of precision that was attained 
in this experiment is encouraging. 
Certainly the model is distinguished 
from others in the uniqueness of its 
predictions concerning a given experi- 
ment. 

In examining what happens in the 
experimental setting, one observes ac- 
tivity reminiscent of Kantor’s analysis 
of a response within a behavior seg- 
ment (10). Kantor has defined a re- 
sponse as being composed of three 
reaction systems: the attending, the 
perceiving, and the consummatory. 
Explicitly, in this and similar experi- 
ments, the subject is required to orient 
toward, or attend to, the stimulus pat- 
terns, to respond visually, and to emit 
some operant response. In the model, 
both conditioning and extinction are 
regarded as probability transfers be- 
tween an observed class of behavior 
and some mutually exclusive classes 
of behavior, without examining the 
nature of the non-observed classes. 
Asconditioning takes place, competing 
behavior decreases in strength. As 
extinction occurs, competing behavior 
takes the place of the experimentally 
measured class. The kernel of the 
argument is that during extinction, as 
competing behavior increases 1n prob- 
ability, the competing behavior carries 
with it its own postural or orienta- 
tional aspects. These changes in orl- 
entation, or physical position of the 
subject, systematically increase the 
relative size of the S'—I set, thereby 
reducing the probability of sampling 


an S’ element. This has the effect of 
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distorting estimates of both the sam- 
pling ratio and the constants of pro- 
portionality. 

It logically follows that a corre- 
sponding distortion should occur in S 
or conditioning trials. The value of 6 
should increase as conditioning pro- 
gresses. Estes has already indicated 
that this is the case (4). Hence the 
more rapidly conditioning progresses, 
the more rapidly will it progress. This 
may have some relevance to analyses 
of “learning sets.” At the same time, 
if extinction is rapid at first, it may be 
expected to interfere with itself, so to 
speak, since sampling efficiency is 
reduced. 

The second distortion so introduced 
is in the constants of proportionality, 

kandk’. Initially, uncontrolled stim- 
uli are present as a part of the inter- 
cept. As the systematic changes take 
place, as described above, which trans- 
late much of this uncontrolled material 
into S’—J, I is reduced relative to 
both S and S’. It is here that some 
behavioral meaning can be given the 
Bush-Mosteller statement about the 
diminishing intercept. Such a dis- 
placement tends to exaggerate the de- 
parture of the S and S’ curves from 
Po; thus an analysis is possible which 
accounts for the occasional instance of 
near-perfect discrimination in a man- 
ner which is preferable to some others 
which have been invoked to account 
for this phenomenon elsewhere. The 
discrimination operator D in the Bush- 
Mosteller model is clearly a mathe- 
matical device introduced with little 
rationale other than the fact that it 
produces curves which look more like 
discrimination curves. Restle’s adap- 
tation model is also designed to handle 
this problem, but it unfortunately in- 
troduces more elaborate assumptions 
concerning the conditioning process 
(12,13). The present analysis would 
seem preferable since it involves no 
more complex assumptions concerning 


the conditioning process than does the 
original Estes-Burke theory, and since 
exact experimental tests of its correct- 
ness are immediately evident. 

If orientational changes do in fact 
distort the parameters k and k! as a 
function of trials, they should do so in 
a given experimental setting to a 
greater or lesser degree, depending 
upon the number of experimental ele- 
ments actually manipulated within 
the three subsets. That is, the fewer 
the number of experimental elements, 
the greater is the relative contribution 
of the uncontrolled stimulation and 
the greater the distortion produced in 
the constants of proportionality when 
the uncontrolled elements shift. The 
more elements existing within the sets, 
the less is the relative contribution of 
uncontrolled stimulation, and the less 
will a shift distort k and RY. In the 
experiment presented above, the great- 
est distortion does indeed exist where 
the fewest elements are involved. 
While this is only suggestive that the 
Present analysis is correct, current ex- 
perimental work is under way to eval- 


uate this phase of the problem in 
detail. 


One technique to red 
tematic distortion has b 
by Estes, where intertri 
increased. One migh 
this provides ample 
the subject to sampl 
the environment, suc 
tematic sampling in 
little relative effect, 
orientational changes as 
ducing, this reduces to the analysis 
above in that the total number of 
stimuli is increased, Certainly a map- 
ping of the response space is a sorely 
needed development in further enter- 
prises of this kind. 

The alternative to this analysis 
would seem to involve the intro- 
duction of additional complicating 
assumptions; while any number of 


uce the sys- 
een proposed 
al intervals are 
t reason that 
opportunity for 
e everything in 
h that any sys- 
S’ would exert 
In regarding 
stimulus pro- 
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mathematical devices can readily be 
assumed which more or less solve the 
curve-fitting problem, the experimen- 
tal approach remains the more satis- 
fying. 

SUMMARY 


A particular examination of the S’ 
peak from the standpoint of an exten- 
zon of the Estes-Burke statistical 
earning theory supposes the peak to 
ee eated by simultaneous additive 
Do subtractive processes working 
a response probabilities associated 
ae oe elements within and 
som the intercept of the S and iS 
Reads s. Certain exact experimental 
oe pas are made and examined, 
na aos questions concerning its use 
are a oping a program of research 
a as it compares with other 
availabl approaches. In the light of 
ated i evidence, extensions are sug- 
a, or broadening the scope of the 

analysis. 
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THE PSYCHOLOGICAL REVIEW 


SENSATION AND PERCEPTION IN AN 
OBJECTIVE PSYCHOLOGY* 


C. H. GRAHAM 


Columbia University 


This paper will take up what might 
have been called, in an earlier day, a 
systematic treatment of the topics of 
sensation and perception. In particu- 
lar, it will consider their places within 
the formal structure of an objective or 
behavioristic psychology. 

The development of such a psychol- 
ogy has been a major historical occur- 
rence of the last 35 years, and, in some 
respects, it has achieved a considerable 
level of maturity. Despite the fact that 
behavioristic descriptions of perception 
were advanced at a relatively early date 
(see, for example, 9) the area of per- 
ception still remains a difficult formal 
problem for systematization within the 
framework of an objective psychology: 
The reason for the refractory nature of 
the subject probably lies in the com- 
Plexity of its relations as contrasted 
with the deceptive familiarity of its 
terminology. In any case the general 
topic requires analysis. Specifically this 
Paper will discuss the following five top- 


1The preparation of this manuscript was 
Upported by a contract between the Office of 
‘aval Research and Columbia University. 
€production in whole or in part is per- 
Mitted for any purpose of the United States 
mie ernment. The account encompasses, with 
nor modifications, the content of the writers 
Presidential address to the Eastern Psycho- 
Pee Association at Atlantic City on Mar 
» 1956, 


ics: types of stimulus-response relations 
encountered in perception, the problem 
of the independent statuses of sensation 
and perception, the problem of descrip- 
tive and explanatory terminology, some 
recurring considerations in perception, 
and perception and psychological theory. 

It is hoped that the discussion of 
these seemingly miscellaneous topics 
may take on the character of an inte- 
grated account. In all of this we shall 
not deal with explanatory principles as 
they relate to specific processes. We 
know that the area of perception is rich 
in such principles, particularly as re- 
spects vision and audition. What will 
be done is to consider some general 
characteristics of the large over-all area 
and its relation to other fields of psy- 
chology. The attempt will be supple- 
mented by brief references to a sym- 
bolic notation, in terms of stimulus and 
response, that has been discussed previ- 


ously (3, 4, 5, 6). 


Types OF STIMULUS-RESPONSE 
RELATIONS IN PERCEPTION 


In analyzing the sort of behavior that 
can be encountered in experiments on 
sensation and perception, let us start 
with a particular type of experimental 
situation, one that applies to the experi- 
ment dealing with introspective reports 
of sensations or perceptions evoked by 
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particular objects or events involving 
the objects. Concretely, the question 
arises: What do I have to do in this 
situation to get a subject to give me a 
phenomenal or introspective report in 
the presence of certain objects that I 
supply? 
Proper procedure in such a situation 
demands, of course, that I instruct the 
subject to give me a phenomenal or in- 
trospective report and that I supply the 
appropriate objects in their appropriate 
relations. The consequent behavior on 
the part of the subject will be the emis- 
sion of a sequence of words—words that 
are, we presume, functions of the sub- 
ject’s past history of reinforcements (by 
parents and other persons) in a se- 
quence of discriminative trainings as 
discussed, for example, by Skinner (15). 
These verbal responses are often said 
to be “descriptive” of stimuli. In situa- 
tions such as the one described, the ex- 
perimenter hears such words given by 
the subject as moving, color, short, 
white, object, large, red, duration, size, 
movement, intensity, texture, and a 
whole gamut of expressions that “ap- 
ply” to the character of the stimuli. 
Since these words were conditioned in 
response to aspects of stimuli that are 
now present, they are “appropriate” re- 
sponses, both to the subject and the 
similarly conditioned experimenter. In 
all of this, the place of the instruction 
stimuli in restricting the Possible num- 
ber of responses remains to be under- 
stood. To say that they provide the 
necessary “sets” is not very informative, 
It would be of considerable interest 
to compare a subject’s behavior under 
. instructions to introspect or give a phe- 
nomenal description with behavior un- 
der instructions to describe objects, 
Certainly the subject may manifest dif- 
ferent behaviors as exemplified in “the 
subjective attitude” and the “objective 
attitude? To what extent differences 
are apparent in his behavior is a ques- 


tion on which not very many relevant 
data exist at present. While the prob- 
lem should not deter us, it may have 
later implications for a behavioristic ac- 
count of perception. 

In experiments of the sort under con- 
sideration—that is, experiments on phe- 
nomenal appearances or introspective 
reports—it is an empirical fact that the 
instructions used do not greatly restrict 
responses. The latter may exhibit a 
great many degrees of freedom—that is, 
many different words may be spoken. 
In addition, neither stimuli nor re- 
sponses are usually measured in the ob- 
servations on phenomenal appearances 
or introspection. 

The problem of analyzing and classi- 
fying the responses in an experiment of 
this type is, of course, one of the major 
problems of psychology. In particular, 
the problem of analysis and classifica- 
tion of responses does not involve ac- 
cepting them as understandable con- 
versation, but is one of formulating 
their uniformities and tules in a system 


where they are taken as behavior. This 


sort of analysis provides the anomaly 


that conversation is not viewed as such, 
but is considered in terms of other 
S—that is, scientific 
formulates the rules 


it, One that, as contrasted with 
t on Introspection, shows 


ntifically at a some- 
of development than 
periment, due to the 


stimuli and respo 
lated and measure 


course, this type of, 
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other. In this situation the type of rele- 
vant procedural question is: What do I 
have to do to get a subject to say, with 
a certain degree of probability, “I see a 
light,” or its converse, “I don’t see a 
light”? The subject, in the course of 
any single stimulus presentation, can 
only give one of the two possible re- 
sponses (or under appropriate condi- 
tions, their “equivalents,” Yes or No). 

This type of experiment is the classi- 
cal psychophysical experiment. It turns 
out, in this situation, that light inten- 
sity must be manipulated, and we can 
plot some measure of response—for ex- 
ample, its probability of occurrence— 
against some appropriate function of in- 
tensity. This procedure has been fol- 
lowed in Fig. 1. 

This figure gives the psychophysical 
data of three subjects for an experiment 
On quantum requirements at threshold 
by Hecht, Shlaer, and Pirenne (7). 
Such a function is a stimulus-response 
relation that shows how a measure of 
response varies with a controlling stimu- 
lus variable. From such a relation it is 
Possible to obtain a new datum, a value 
of the stimulus variable that corre- 
sponds to a given probability of re- 
sponse occurrence—for example, -the 
50% value. This new and extremely 
important datum is familiarly known to 
us as a threshold. 

The two categories of the experiment 
that have been discussed, the phenome- 
nal report and the psychophysical, lie 


s zo 


ob- 
and 


t Fic. 1. Frequency of seeing functions 
ained on three subjects by Hecht, Shlaer, 
irenne (7) in their experiment on quantum 

Tequirements for visual threshold. 


at the extremes of an array of psycho- 
logical experimentation, the array be- 
ing ordered on the basis of degree of 
stimulus and response restriction. The 
introspective experiment involves rela- 
tively unrestricted stimuli and responses, 
while the psychophysical experiment 
usually involves a high degree of re- 
striction in both. 

Other types of experiments take up 
positions between the extremes of such 
a continuum. One may have in mind 
particularly the type of experiment in- 
volving absolute judgments, estimates, 
or naming behavior. This type of ex- 
periment deals with the relation be- 
tween environmental stimulus variables, 
often measurable in some theoretical 
numbering system taken, for example, 
from physics, and large families of vari- 
able responses. This is the sort of ex- 
periment that involves such procedural 
questions as: What do I have to do to 
make a subject say “I see five dots,” or 
“This is red,” or “This has a value of 
three on a scale of one to five”? The 
response given in this experiment is one 
that has developed during the subject’s 
life history in the presence of objects. 
“Naming a color” and “giving a numeri- 
cal estimate of brightness” are examples 
of such responses. 

Figure 2 gives a set of hypothetical 
curves for color naming in the spectrum. 
Each curve gives, as a function of wave 
length, the frequency of occurrence of 
the response that applies to the curve.” 

As contrasted with the method of 
constant stimuli with two categories, 
where only two responses can be elicited 


2The curves of Fig. 2 were not drawn on 
the basis of data. They were constructed on 
the basis of “reasonable guesses” relating to 
other kinds of color discrimination. Since this 
paper was written, Aleeza C. Beare, working 
at Columbia, has made experimental deter- 
minations of color naming functions at two 
levels of intensity. The functions are, in fact, 
quite similar to the hypothetical ones shown 
in Fig. 2. 
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Fic. 2. Hypothetical curves for frequency 
of occurrence of color names in various parts 
of the spectrum. 


no matter what the value of the dis- 
criminative stimulus, the method of ab- 
solute judgment allows for the produc- 
tion of a great number of responses. 
For example, in the naming of a color, 
a considerable change in wave length 
can result in many changes in response. 
Many problems beset our understanding 
of this sort of behavior function. In 
particular, we cannot specify, as yet, 
quantitative continua that represent 
relevant amounts of differentiated re- 
sponses. 

A number of undesirable ways of 
talking about the results of the experi- 
ment on absolute judgments hinge on 
this problem. Here one may mention 
two examples. First, there is the de- 
vice, harmless enough if we keep in 
mind what is being done, that has some- 
times been used to imply response con- 
tinuity. This is the device of assuming 
a response continuum as the correlate 
of physical dimensions of the stimulus, 
as when we say that “orange” (rather 
than the wave lengths that evoke the 
response “orange”) is next to “red” in 
the spectrum (rather than the wave 
lengths that evoke the response “red”). 

Secondly, there is the procedure of 
treating numerical estimates as if they 


were numerical data. For example, the 
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word fen emitted by a subject in an ex- 
periment on estimated brightness may 
be treated, erroneously, I think, as if it 
were the quantified outcome of a meas- 
uring operation; the symbol for re- 
sponse (accepted by the experimenter 
as a number) is added to the list of 
other numbers of the same sort, and the 
experimenter says that an “average” re- 
sponse is finally computed. No inde- 
pendent operations of measurement 
have been applied under these circum- 
stances to the subject’s responses. 

In any case, it should be pointed out 
that we can theoretically derive critical 
values—threshold values, if you will— 
from certain types of experiments on 
absolute judgments. For example, we 
can find, on the average, how many 
dots are required for the response five. 
We can also find out how many are 
required for the responses four, three, 
two, one, etc. Similarly, in the case of 
estimates of brightness, one can find the 
value of luminance that is on the aver- 
age correlated with the response ten, 
for example. We do not find what the 
average response is for the same lumi- 
nance. 

The concept of threshold, i.e., a criti- 
cal value of stimulus Corresponding to 


a given response criterion, to which 
previous reference has been 


made, need 
not be elaborated here; it is an old 
and pervasive principle of ps chology. 
Nevertheless, Pee 


t it will not be out of place 
to point out that, by virtue of this 


quantity, it is possible to treat as a 
single datum the outcome embodied in 
a total psychophysical function, and so 
derive what might be called perceptual 
functions—that is, functions obtained 
from a sequence of experiments con- 
cerned with finding thresholds under 
many different conditions. Such experi- 
ments pose such procedural questions as 
the following: What do I have to do 
to get a subject to say “This light is 
brighter than that one” under condi- 
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tions where I systematically vary the 
standard light? 

Figure 3 shows such a sequence of psy- 
chophysical curves obtained by Mueller 
(14) in his determination of the just 
discriminable increment in intensity at 
various intensity levels. Each curve 
gives a threshold AJ, the critical value 
of AZ taken as corresponding to the 
50% response occurrence. 

In a further extension of the data, we 
determine how the second-order datum, 
the threshold AJ, varies with changes in 
the standard light. The function ob- 
tained in these circumstances shows how 


FREQUENCY — % 


FREQUENCY — % 


(“frequency of seeing’ 


determined values of AI (each one based 
on a psychophysical function) vary with 
the intensity of the standard light. 

Figure 4 represents such a function. 
It is the curve for Mueller’s threshold 
values of AJ as a function of the back- 
ground intensity to which AJ is added. 
Such a function, which we may term a 
perceptual function, shows how a criti- 
cal value of stimulus, the threshold, 
varies as a function of a controlling 
variable. 

It is of interest to examine the pres- 
ent analysis in terms of a symbolic for- 
mulation that has been used (Graham, 
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Fic. 4. Intensity discrimination ratios 
(AI/I) at various levels of intensity based on 
the critical AJ’s obtained by Mueller (14) at 
the 50% frequency of occurrence values in 
the curves of Fig. 3. 


3, 4, 5, 6) to specify some classifica- 
tions of relations among stimulus, re- 
sponse, and conditions of the subject. 
Specifically, the variables of the present 
analysis must be examined to see if they 
can be described as special cases of 
the general relation 


R=f(a,6,¢...n...t...%,9,2) [1] 


This relation expresses the assumption of 
a behavioristic program: response is a func- 
tion of certain specifiable variables. In 
particular, the first letters of the alphabet 
(a, b, c...) refer to properly specified as- 
pects of stimuli; the last letters (.. am, A 
z), to properly specified condition of the 
subject (physiological and inferred, includ- 
ing the effects of instruction stimuli); R, 
to response; 7, to number of presentations; 
and ¢, to time. The terms are not always 
independent of each other (Graham, 6, p. 
62). 
It is important to realize that x, y, and 
z may be, among other things, the con- 
sequences of past stimulation. 

The stimulus-response data of a psy- 
chophysical experiment can be repre- 
sented, at least for the cases of the 
methods of constant stimuli and limits, 
by the equation: 


R = F(a, %1) [2] 


This equation is a special case of Equa- 
tion 1: response is a function of an as- 
pect a of the stimulus, with x, the effect 


of instructions, explicitly specified as 
constant. The same sort of analysis 
can be applied to the data of absolute 
judgments. Here it must be admitted, 
parenthetically, that to a considerable 
extent the discussion here presented de- 
pends for its validity on the choice of 
the methods of constant stimuli and 
limits, both with extensive series of ob- 
servations. When the method of ad- 
justment is used, response measurement 
becomes difficult. 

A combination of psychophysical ex- 
periments provides the data for a per- 
ceptual function as in 


ao = (b, x1, Ri) [3] 


where a critical aspect of stimulus āo is 
represented as a function of another as- 
pect of stimulus b. In this case xı, the 
effect of instruction, and Ry, the cri- 
terion response, are maintained constant. 
In other experiments, a could be deter- 
mined as a function of such variables as 
n, t, or y. 

Equations l, 2, and 3, and similar 
ones give a convenient classificatory ac- 
count of the types of relations found in 
the psychophysical experiment, What 
about the relations of the experiment on 
phenomenal appearances? Is there a 
distinction between the data of the in- 
trospective and psychophysical experi- 
ments? Certainly there seems to be in 
two respects. First, both the stimuli 
and responses of the introspective ex- 
periment are relatively unrestricted. 
They depend for their occurrences upon 
the behavior of a subject who is re- 
sponding and being stimulated in many 
different ways. Secondly, neither stimuli 
nor responses are measured in the in- 
trospective experiment, and so a quanti- 


tative description of stimulus-response 


dependence is impossible. While it is 
true that stimuli and responses are pre- 
sumably related from a rigorous mathe- 
matical point of view, the statement has 
little utility in the absence of knowl- 
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edge of the specific relation. Neverthe- 
less we can use the reports of such an 
experiment at a conversational level to 
provide us with hypotheses to be tested 
in the more formalized psychophysical 
experiment. We may also examine the 
reports themselves with a view to ana- 
lyzing them in the context of a descrip- 
tive or explanatory system of language. 

It is to be hoped that we shall not 
long remain ignorant of what the sub- 
ject is doing in the introspective experi- 
ment, and, in fact, in any circumstance 
in which he talks. 


THE SEPARATENESS OF SENSATION 
AND PERCEPTION 


Up to the present time nothing has 
been said about differences that may 
exist between sensation and perception. 
For all practical purposes the two top- 
ics have been considered to be one. 
The question now arises: Can they, in 
fact, be differentiated on the basis of 
appropriately selected criteria? 

Sensation and perception were ac- 
Cepted as different in the general frame- 
work of a structural psychology. Sensa- 
tions were taken to be the irreducible 
elements of consciousness; and percep- 
tions, the carriers of meaning, were 
Judged to be aggregates of sensations 
and images (Titchener, 16). Expo- 
nents of phenomenological description 
took another tack. They regarded their 
Particular interest as concerned with the 
Phenomenal world or the phenomenal 
field (cf., e.g., Gibson, 2; Katz, 10; 
MacLeod, 13), depending on the dif- 
erent attitudes taken in observation. 
mA course, Gestalt psychologists have, 
eet their beginnings, disclaimed the 
meeene of sensation in toto; they deal 

ith stimulations and their consequent 
Patterning (cf. Koffka, 11; Köhler, 12). 
by te the great differences exhibite 

il hese various psychologies in the de- 
exp „Of their treatments of conscious 
Xperience, they do demonstrate, at 


least superficially, an emphasis on the 
importance of the patterning of units 
in perception—units that are variously 
conceived to be sensations, stimulations, 
or cues. 

In addition to their usage within the 
context of psychologies of experience, 
the terms sensation and perception are 
sometimes used in a different, possibly 
unwarranted manner. In consequence 
of their preoccupations with the nature 
of stimulation, psychologists often now 
use the term sensation to include what- 
ever physiological theory and data of 
sensory processes may seem to be ap- 
plicable in a given circumstance. Sen- 
sation has come to be thought of as 
essentially physiological, whereas percep- 
tion, which is relatively poor in physio- 
logical theory, is considered to be psy- 
chological. 

In view of the long historical back- 
ground of the terms sensation and per- 
ception, one may legitimately ask: Are 
either or both terms needed in an ac- 
count of behavior? It must be admitted 
that one could use many terms as con- 
venient classificatory devices in discuss- 
ing behavior. For example, the term 
discrimination might be used to cover 
the class of functions that are of interest 
here. It is probable, however, that the 
latter term would usually be interpreted 
in a more restrictive sense than would a 
term that covers the array of functions 
here discussed, characterized as they 
are by the fact that response is treated 
as a function of stimulus, with condi- 
tions of the subject due to past re- 
inforcements at constant or limiting 
values. (Of course this statement does 
not mean that we are not interested in 
past history parameters! In the latter 
circumstances, response is treated as a 
function of conditions of the subject 
with stimulus constant. But of course 
the latter descriptions have their own 
designation: learning or conditioning 
functions.) 
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In any case, at least for the present, 
the term perception will be retained to 
refer to classes of stimulus-response 
functions that represent constant values 
of reinforcement parameters. Does a 
basis exist for retaining the term sensa- 
tion independently of perception? 

Let us examine the problem. The 
behavior observed in all perception 
experiments (introspective, phenomeno- 
logical, or psychophysical) involves 
stimulus discriminations and response 
differentiations. The correlated terms 
in the observed stimulus-response func- 
tions are the experimental data. In ex- 
periments that provide such data, an 
instruction stimulus is given, ideally 
only once; thereafter, a stimulus as- 
pect is changed and certain responses 

are observed. Variations in a con- 
trolling parameter of the stimulating 
circumstances give rise to correlated re- 
sponse occurrences or changes. This 
context seems to cover all of the func- 
tions classifiable under the heading of 
perception, with no surplus functions 
requiring a new heading. Thus, on the 
side of description of experimental op- 
erations, it seems that no additional 
term is needed. It is true, of course, 
that the needs of theory cannot be 
finally prejudged. It might be that 
some theoretical analysis could provide 
appropriate criteria for the use of two 
or, in fact, more terms to specify proc- 
esses of different orders of complexity. 
So far the use of two terms does not 
seem to be based on mutually exclusive 
criteria. We seem to do well enough 
with one. 


THE PROBLEM or TERMINOLOGY 


The analysis of experiments in per- 
ception and their related concepts pose 
an important practical and theoretical 
question: Shall we dispense with many 
of the terms of an earlier vocabulary 
and try to develop new terms for a be- 
havior science, OY shall we define new 


meanings for the old terms? It may be 
too early to tell whether an account of 
behavior can more advantageously use 
old terms, appropriately respecified, or 
new terms, divested of certain histori- 
cal contexts. In any case, it will prob- 
ably do no harm to see what considera- 
tions might arise from the use of one 
set—specifically, the old terms. Such 
an examination does not now imply an 
ultimate preference for one system as 
opposed to the other. It may, how- 
ever, be supposed that a sequence of 
such examinations would at a later time 
provide us with a basis of choice. 

Let me take some examples from the 
field of color. In this area it might 
conceivably be possible to get along 
without such words as color, brightness, 
Saturation, or hue, provided that we 
could specify the end terms of a given 
discrimination. Experience, however, 
shows that, even if we could always (as 
we cannot) express uniquely the energy 
relations that correlate with a given re- 
sponse (and vice versa), the use of such 
terms would be cumbersome. For this 
reason we shall here consider the roles 
of such words as hue, saturation, bright- 
ness, color and chromaticness. We shall 
not argue the question of whether these 
names are labels, intervening variables, 
or hypothetical constructs. The appli- 
cable terms will probably be chosen on 
the basis of the theorist’s attitude, 

Consider, first, the use of the term 
hue. This term is to be understood as 
either a label for or as an inferred ef- 
fect (behavioral, physiological, or mathe- 
matico-representational) in the follow- 
ing stimulus-response sequence: (a) in- 
structions to a subject who has had a 
past history with the vocabulary repre- 
sented in the instructions, (b) the pres- 
entation of radiant energy to the sub- 
ject, and (c) the subject’s responses. It 
turns out, as a matter of empirical fact, 
that wave length is the most important 
variable in (b) for hue discrimination. 
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The dependence of hue discrimination 
on wave length may have been estab- 
lished early in such observations as the 
ordering of stimuli by a person when he 
is instructed to arrange colors on the 
basis of hue. The subject is said to 
discriminate differences in hue when he 
gives one response (for example, a hue 
name) to a radiant flux of one narrow 
wave-length band and another response 
to flux of a different wave length. An- 
other type of correlated observation in- 
volves the subject’s giving (@) one re- 
sponse (“No, there is no difference in 
hue”) to a small difference between 
wave lengths of a pair of stimuli, and 
(b) another response (“Yes, there is a 
difference in hue”) to a larger differ- 
ence. The threshold wave-length dif- 
ference is evaluated at the appropriate 
frequency of occurrence of the two re- 
sponses. 

It will be observed that the word Aue 
comes into play at least twice in the se- 
quence instructions, stimulus, response: 
It occurs as part of the instructions (as 
in the statement “arrange these colors 
on the basis of hue”); and it is the 
term for the inferred effect relating 
stimulus and response, or the label for 
the relation. 

The word kue used in the instructions 
(and reacted to by the subject) is ana- 
lytically a different word from the word 
applied to the discrimination. The dif- 


ference in the two words is comparable _ 


to the difference discussed by Bergmann 
and Spence (1) when they contrast the 
social” language of the subject and the 
experimenter with the metalanguage © 
the experimenter as a scientist. Pos- 
Sibly, the difference could be specified 
by the use of subscripts, as in Aue: an 
huea, where the former word represents 
the term in the instructions, and the lat- 
ter the term applied to the discrimina- 
tion. The former term controls a sub- 
jects activity; it “tells him what to 
do”; it may imply little or no theo- 


retical context. The latter term in- 
volves whatever meanings may be at- 
tached to it by its ramifying empirical 
and theoretical connections. Thus, for 
example, it may imply an elaborate con- 
text of physiological mechanisms. 

Considerations comparable to those 
holding for the word Aue exist with re- 
spect to the word saturation. The term 
represents the inferred effect or rela- 
tional label holding between such a 
stimulus variable as colorimetric purity 
and differential responses that exist when 
a subject is instructed to order colors 
on the basis of, for example, “paleness 
of color.” 

Differences in brightness may be dis- 
criminated by processes analogous to 
those holding for hue discrimination. 
The subject, under instructions to ar- 
range stimuli according to differences in 
brightness or to indicate brightness dif- 
ferences, gives different responses (for 
example, the words dim and bright) to 
different luminances, or he may dif- 
ferentially signal a difference between 
luminances. Such discriminations are 
said to be brightness discriminations. 
The word brightness appears in the 
double sense characteristic of the word 
hue. 

The word color represents a generic 
type of discrimination based on a com- 
bination of wave length, colorimetric 
purity, and luminance. The subject, 
properly instructed, can respond by 
naming different colors or with such 
differential responses as “Same color” 
or “Different.” 

The term chromaticness is of consid- 
erable interest, based as it is on a very 
definite type of representational theory: 
the chromaticity diagram. Chromatic- 
ness theoretically represents a generic 
type of discrimination based on a com- 
bination of wave length and a quantity 
called excitation purity. It thus may 
be taken to be a combination of hue 
and saturation, with brightness removed 
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from consideration. As a concept it is 
based on a highly abstract mathemati- 
cal representation, and its correlated re- 
sponses have not as yet been carefully 
considered. 

The word perception itself requires 
some discussion. The word does not in 
itself refer to an inferred effect having 
the significance of an intervening vari- 
able (as has been the case with the 
words previously discussed). It is a 
class name for all such variables. Pre- 
sumably, it is to be understood as the 
class of all relations conventionally sub- 
sumed under the word perception. (We 
shall not here go into the problem of 
formal bases for this subsumption.) In 
any case, we should not expect to hy- 
pothesize about a generic process of 
perception as we might about a specific 
component process, 

This discussion of representative terms 
in the field of color, some being taken 
directly from an older psychology, will 
probably show that the latter terms can 
be used in the vocabulary of a behavior 
science. Whether they will serve there 
more suitably than new terms is an open 
question. Would it be better, for ex- 
ample, to speak of wave-length discrimi- 
nation rather than hue? On this prob- 
lem at least one thing may be said: 
Comparable aspects of wave-length dis- 
crimination show different dependencies 
on wave length as functions of other 
variables. Put in another way, hue 
varies with other factors than wave 
length. The difficulty posed by this 
fact can be surmounted, but possibly 
only at the cost of some terminological 
cumbersomeness. 


Some RECURRING CONSIDERATIONS 


Let me now take up some recurring 
considerations in the study of percep- 
tion. In particular, I feel that the old 
empiricistic-nativistic controversy may 
still have some implications for present- 


day studies. I am not pleading for a 
renewal of this ancient argument, but 
do desire a clarification of some of its 
concepts in the interest of benefiting our 
present understanding. 

Any situation that involves a dis- 
crimination, an estimate, a denotation, 
or a naming response employs a subject 
with a system of responses. Many, or 
most, of the responses are the end- 
products of the subject’s history of act- 
ing in a complex and changing environ- 
ment. Now it is never shown in re- 
searches on the intact organism that 
the subject’s past history is irrelevant. 
What may be shown is the fact that, in 
certain situations, past history variables 
do not establish the form of a given 
function. In a word, such variables 
exist at constant values. Under other 
circumstances, they do influence the 
form of a function; they are contribut- 
ing variables. The identification of 
both classes of function is a matter for 
experiment. 

Past history variables are important 
in some types of perceptual behavior, 
and they are now receiving increased 
attention. Studies of motivational fac- 
tors in perception are being conducted 
at an accelerated pace. Despite some 
criticisms that can be made of certain 
recent experiments, nevertheless, in gen- 
eral they support a permissible modern 
interpretation of Helmholtz’s Position 
(8) on unconscious inference: discrimi- 
native behavior is influenced by past 
conditions of the subject. The study of 
the latter effects is in one of the two 
major directions of perceptual investi- 
gation. In pursuing this line, answers 
may be sought to such questions as the 
following: What do genetic studies tell 
us about the development of certain 
discriminations? What sorts of train- 
ing procedure can be shown to influence 
discriminations and how do they do it? 
How will different instructions change 
the responses of the subject? What are 
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appropriate dimensions for the quantifi- 
cation of instructions? 

The other major rank of perceptual 
study is, of course, an old one. It is 
concerned with the problem of how sys- 
tematic variations in stimulus condi- 
tions influence discriminations. This 
area has had its problems and contro- 
versies, but we need not revive them 
now. It is sufficient to say that the 
necessity for advancing our knowledge 
of the energy bases of discrimination to 
more refined levels remains with us. It 
is true that some quantitative experi- 
ments have been done in the past, but 
it is also true that experimental pa- 
rameters have not always been varied 
Over a large range and in a systematic 
manner. Put briefly, many perceptual 
experiments of the past have often been 
demonstrational rather than analytic, 
Particularly in cases where variables 
other than intensity and wave length 
have been the main centers of interest. 
One thing the study of perception needs 
is adequate data to provide a firm base 
for theory. In the future it may prob- 
ably turn out that many interesting 
demonstrations may be seen for what 
they are—special cases of much more 
general quantitative principles. 


PERCEPTION AND PSYCHOLOGICAL 
‘THEORY 


It seems interesting and important 
that we can go a considerable way in 
Classifying some of the discriminative 
functions in perceptual behavior an 
that we may designate two directions 
which appropriate studies may take: 
investigating (a) conditions of the sub- 
Ject, and (b) stimulus conditions. 

It is probably true that the studies of 
Conditions of the subject (in the first 
ta ection) must be brought into relation 
ie earning and conditioning. To put 
in riefly, such studies must make mean- 
Mgful contributions to our understand- 
ng of behavior modification, the area 


where past history variables are of pri- 
mary concern. 

Studies of the stimulus control of be- 
havior (in the second direction), when 
properly made, usually do not show the 
influence of past history as a contribut- 
ing variable. The experimentalist and 
theorist in this area has the responsi- 
bility of relating his findings to an al- 
ready existing precise body of formal- 
ized information. In a sense, then, the 
experimentalist in this area must do 
more than classify the behavior func- 
tions. He should be sensitive to the 
need for relating his findings to appro- 
priate theory, sometimes in the vocabu- 
lary of psychology, sometimes in the vo- 
cabulary of physiology, and often in the 
terminology of mathematics. By his 
contributions the psychologist interested 
in the stimulus control of behavior can 
provide sounder bases for the interpre- 
tation of the often admirably conceived 
but disconnected hypotheses that now 
exist, and, in a larger context, advance 
the synthesis of these hypotheses into a 
more inclusive theory. 

Finally, it is clear that an ultimate 
synthesis of information obtained in fu- 
ture investigations—i.e., from studies of 
(a) conditions of the subject and (b) 
the stimulus control of behavior—will 
be required for purposes of a more em- 
bracing theory. Data from the first di- 
rection will tell us how discriminative 
behavior develops and what are the con- 
ditions for its development; and data 
from the second direction will tell us 
how such discriminations are controlled 
and limited. Such a synthesis will make 
the study of perception coextensive with 
the general study of behavior. 


SUMMARY 


This paper discusses five topics that 
are of systematic importance in the 
treatment of sensation and perception 
from the point of view of an objective 


psychology. 
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(1) Types of stimulus-response rela- 
tions are considered as they relate to 
the following topics: (a) introspective 
or phenomenological descriptions, (b) 
psychophysical discriminations, (c) ab- 
solute judgments, and (d) thresholds 
and perceptual functions. Only (b) 
and its derivative (d) usually involve 
measured aspects of both stimulus and 
response. In the case of (c), the re- 
sponse is usually not quantitative. In 
the case of experiments involving intro- 
spection or phenomenological descrip- 
tion, neither stimulus nor response is 
quantitative. 

(2) The terms sensation and percep- 
tion probably do not refer to two dif- 
ferent operationally specifiable concepts. 

(3) Problems of terminology are con- 
sidered. Such terms as hue, brightness, 
and saturation are examined and ana- 
lyzed. It would probably be possible 
in a behavior science to dispense with 
such terms, but they may have the ad- 
vantage of reducing cumbersomeness. 
Their meanings within a context of 
stimulus-response relations are consid- 
ered. 

(4) Some recurring problems of per- 
ception are described, with particular 
reference to the kinds of variables in- 
volved. It is stated that the two ma- 
jor directions of perceptual investigation 
involve studies of the effects on dis- 
criminations of (a) stimulus conditions 
and (b) past history parameters and 
conditions of the subject. 

(5) Analysis of the classes of stimu- 
lus-response functions concerned in per- 
ception indicates that the study of the 
latter topic becomes coextensive with 
the general study of behavior. 
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TOWARD A QUANTITATIVE DESCRIPTION OF 
LEARNING SET DATA 


FRANK RESTLE 1 


Michigan State University ? 


When the same monkey, ape, or 
child is given hundreds of object-dis- 
crimination problems to solve, he gets 
better and better at mastering them. 
Controls in the basic experiments 
eliminate two attractive explanations 
of this improvement. The cumula- 
tive transfer of training is not based 
on cues common to and peculiar to 
the correct objects, for problems are 
made up by random pairings from a 
pool of mutually discriminable ob- 
jects. The cumulative improvement 
is not likely to be based solely on 
habits of conforming to the procedure 
of the experiment. Monkeys typi- 
cally receive months of preparatory 
training in moving objects, in picking 
up rewards, and in general “good be- 
havior” before the learning-set (LS) 
experiment itself begins. 

It would appear, to the naive ob- 
server, that monkeys in an LS experi- 
ment “get the idea” of solving dis- 
crimination problems. An adult 
human subject, trained in this fash- 
ion, would soon realize that of the two 
objects presented in any problem, one 
is rewarded and the other not. He 
would learn that when new objects 
appear, the correct response is unpre- 
dictable until after one information 
trial, but that, in subsequent trials 
With the same objects, reward is con- 
Sistently given and predictable. The 


1 The author is indebted to H. F. Harlow, 
Vi M. Warren, and A. J. Riopelle for many 
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data suggest that monkeys also re- 
spond in this fashion, though they 
acquire this general understanding of 
the situation only very slowly. 

If we had available only the obser- 
vation that monkeys improve in suc- 
cessive discrimination problems, it 
would be tempting to try an explana- 
tion in terms of the simple processes 
characteristic of naive rats. The 
compounding of further detailed in- 
formation about the performance of 
sophisticated monkeys and apes makes 
such a reduction seem almost impos- 
sibly difficult. But an attempt to 
formulate the intuitively sensible hy- 
pothesis that monkeys gain an ab- 
stract understanding of the situation 
puts a great strain on current theoret- 
ical resources. Cognitive discussions 
of learning and performanceare validly 
criticized on the ground that they do 
not yield precise predictions regarding 
behavior and thus are not conveniently 
verifiable. In the present paper, an 
attempt is made to formulate a de- 
scriptive theory of LS formation and 
operation which centers on an assump- 
tion of abstract cognition, and yet 
yields exact quantitative predictions 
which can be, and in some examples 
below will be, compared with the de- 
tails of monkey performance. 

The theoretical aim is to summarize 
a variety of data, using a few assump- 
tions. To accomplish this kind of 
parsimony the writer has found it 
expedient to violate another informal 
canon of learning theory: that proc- 
esses invoked should be simple and 
familiar. It is assumed that monkeys 
use an abstract understanding of the 
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pattern of an LS experiment, trans- 
cending the “‘stimulus-response” ru- 
bric familiar in most theories of learn- 
ing. 

Since the present formulation grew 
out of attempts to extend an earlier 
theory of discrimination and choice 
learning (10,11,12) to LS experi- 
ments, it is not surprising that the LS 
theory contains the earlier theory asa 
special case. However, it would be 
wrong to consider this LS theory a 
deduction from the simple discrimin- 
ation learning theory. The logical 
implication is from the LS to the 
simpler theory, not vice-versa. 


EMPIRICAL SCOPE OF THE THEORY 


The present theory is intended to 
describe the change in height, rate, 
and shape of within-problems learning 
curves with successive problems. 
While the general characteristics of 
the data are represented in the theory, 
complications arising from our igno- 
rance about how the stimulus objects 
appear to monkeys, and indeed about 
the actual stimulus properties of the 
objects, make it difficult to determine 
whether or not the theory is correct. 
In addition, as we shall see, there is 
evidence that the theory is incomplete, 
leaving out one of the factors which 
result in improved performance. 

The formulation may also be re- 
lated to reversal learning-set, and the 
solution of problems like oddities, 
matching from sample, delayed reac- 
tion, etc. These other problems can 
be thought of as involving (for the 
monkey) abstractions like, but differ- 
ent from, those involved in ordinary 
discrimination LS. The theory has 

not been elaborated enough to permit 
detailed analysis of these problems, 
d the present paper will merely 
oa sibilities. 
sketch the pos f a 
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analyzing the performance of sophis- 
ticated monkeys on various perceptual 
problems. After extended training 
the monkey reaches a relatively 
stable state, in which his performance 
on a variety of tasks can be studied 
intensively. It happens that the 
equations describing performance in 
this stable state are quite simple. 
Since the theory purports to rélate 
learning to perceptual characteristics 
of the situation, it is possible to frame 
some interesting, if oversimplified, 
perceptual hypotheses, and then use 
the theory to evaluate them. Though 
these tests of the theory are separate 
from a large part of the general ap- 
paratus of the theory, and involve 
extraneous hypotheses about the 
stimuli, they are helpful in evaluating 
the theory because the interpretation 
of the data hinges on the crucial as- 
sumption of the whole structure—the 
relation between the perceptual situ- 
ation and the rate of learning. 


THEORY OF LEARNING SETS 


The fundamental assumptions of 
the theory are: (a) Solving a discrim- 
ination problem involves two proc- 
esses: conditioning valid cues to the 
correct response, and adapting or sup- 
pressing invalid cues. (b) These proc- 
esses can be quantitatively described 
as the cumulative effects of a random 
process. (c) The rates of these proc- 
esses (that is, the proportion com- 
pleted on each trial) depend on the 
mean validity of cues present. (d) 
The probability of a correct response 
is the proportion of functional (un- 
adapted) cues conditioned to the cor- 
rect response. These assumptions 
have been stated in earlier papers 
(10, 11, 12). 

In learning-set experiments involv- 
ing higher primates, there exist cer- 
tain special cues, which we shall term 
“‘type-a” cues, which are valid (con- 
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sistently reinforced) and common to 
all problems of the experiment, re- 
maining constant when the particular 
stimulus objects are changed. These 
cues are present on every trial except 
the first (information) trial of each 
problem, and they depend on previous 
reinforcements. In effect, a monkey 
responding to a type-a cue is respond- 
ing to “the object which has been 
reinforced.” . He is responding to the 
property of having been reinforced, 
as distinguished from other properties 
such as size, color, spatial shape, ar- 
rangement of parts, etc. These lat- 
ter concrete properties give rise to 
type-b cues, which are valid (consist- 
ently reinforced) during a given prob- 
lem, but which are not valid in the 
long run. There are other cues in the 
Situation, arising from the apparatus, 
certain characteristics of the pro- 
cedure, and, most important, from 
within the animal itself. These cues, 
termed type-c, are not reinforced con- 
sistently at any time, and thus are 
Invalid. 

Indications are that in the monkey, 
type-a cues make up a very small 
Proportion of the cues the animal can 
use. Type-b make up a much larger 
Proportion, about one-third of the 
Cues, and type-c cues make up the 
rest, nearly two-thirds, in ordinary 
object-discrimination problems of the 
Wisconsin type (4). During any one 
Problem, type-a and type-b cues be- 
Come conditioned to the correct re- 
Sponse and type-c cues become 
adapted, so that eventually the ani- 
mal can attain perfect performance on 
the problem. But slowly, as more 
and more problems accumulate, type-¢ 
Cues become adapted permanently, 
So that they have no effect on per- 
ormance. Learning within a prob- 


* Verbal statements of the principle, by 
Young children, are described by Hayes, 
©mpson, and Hayes (6). 


lem is then faster, since it need not 
wait upon the adaptation of the 
type-c cues. Then, even more gradu- 
ally, type-b cues become adapted,‘ 
since they are not as valid as the 
type-a cues. When this process is 
nearly completed, the animal is re- 
sponding in terms of the abstract 
type-a cues only. Since, by theory, 
these cues do not change from prob- 
lem to problem, the animal does not 
learn new problems, but merely ap- 
plies his previous knowledge again 
and again. As soon as one of the 
objects gains the property of “hav- 
ing been reinforced,” through the one 
information trial, the animal responds 
perfectly and continues his accurate 
response for the rest of that problem. 

Thus, this theory describes the fact 
that monkeys and higher primates 
(including children) become capable 
of one-trial solution of object-dis- 
crimination problems (4, 6). The ac- 
complishment of this skill is described 
as a gradual process, again in agree- 
ment with the facts (4,6). But more 
precise evaluation of the ideas de- 
pends on a more specific statement of 
the theory, in quantitative terms. 
We now turn to this problem. 

The quantitative assumptions of 
this kind of discrimination learning 
theory are stated elsewhere (10, 11, 
12). The present theory is identical 
with the most recent statement of 
these assumptions (11), except for a 
refinement of the concept of adapta- 
tion. Only a very brief resumé, as 
needed for this paper, is given here. 

Any cue, k, is conditioned to one 

4 Riopelle has stated, and supported by ex- 
periment, the idea that highly sophisticated 
monkeys are not bound to concrete character- 
istics of the objects. In particular, he showed 
that such experienced monkeys rapidly forget 
previous experiences with particular objects 
(13). Limitations on the asymptotic per- 
formance of cats have been attributed by 


Warren to inability to go beyond the particu- 
lar stimuli (personal communication, 1957). 
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or the other of the alternative re- 
sponses. The probability that & will 
become conditioned to the reinforced 
response is 6, the mean validity of 
cues present. If K is the total set of 
cues present, V, is the validity and 
0; the weight of cue k, then 


0 = DVi8.(k in K) C] 


If Fun) is the probability that cue 
k is conditioned to the correct re- 
sponse on trial n, and the correct 
response is rewarded, then 


Fony = Fan (1 — 0) +0 [2g 


whereas if the other (incorrect) re- 
sponse is reinforced, 


Fanti) = Fem (1 — 6) [3] 


is the correct response is reinforced 
for a block of j trials, n + 1, n + 2, 
-n +j, 


Firn = 1 
— [i — Fam] -0 [4] 


This function, and its relation to a 
cumulative random process, are ex- 
plained by Estes and Straughan (2). 

In applications, we shall assume 
that the type-a cues are consistently 
reinforced on every trial except the 
first of each problem. If, at a cer- 
tain trial of the experiment, there 
have been W such trials, 


Fn) =1-[1—Fey](1—0)¥ 
(a = a type-a cue) [5] 


Since these experiments involve thou- 
sands of trials, and @ is sizeable (at 
least over .10 in most applications), 
Fan) soon approaches 1, 

We shall assume that type-b cues 
are reshuffled at each new problem, so 
that at the first trial of each problem 
some unknown proportion of them 
are conditioned to the correct response 
and the rest to the wrong response. 
Learning then progresses regularly 


during the problem. It is convenient 
to describe a trial as the nth trial of 
the current problem. Then 


Fom=1-[1— Fe, y]d—0) 
(b = a type-b cue) [6] 


Because the conditioning depends 
both on the total noninformation 
trials (N) and the number of trials on 
the current problem (n), we shall 
write Fu.y,n) as the probability that 
cue k is conditioned to the correct 
response at trial N, n, the Nth non- 
informational trial of the experiment, 
the th in the current problem. 

Since type-c cues are reinforced at 
random throughout the experiment, 
they cannot become conditioned. con- 
sistently to the correct response, It 
is convenient for present purposes 
merely to state that 


Few,ny=.5 (c=a type-c cue) [7] 


Adaptation of cues presents a some- 
what more complicated problem, es- 
pecially since some recent results of 
Hammer (3) have shed new light on 
the nature of learned perceptual se- 
lectivity. Previously, it has been 
stated that invalid cues become 
“adapted,” Suppressed, or nonfunc- 
tional (10, 11, 12). But Hammer 
showed that if certain invalid cues are 
adapted in the presence of some valid 
ones, and then the valid cues are 
changed, the invalid cues recover 
their function and are “de-adapted,” 
Or, at least, there is no Positive trans- 
fer of training. This suggests that 
perceptual selectivity is a more deli- 
cate process than mere adaptation of 
invalid cues. It appears that invalid 
cues are adapted with, respect to valid 
cues present, and their status as non- 
functional depends on the presence of 
the valid cues. To say that invalid 
cue 7 is adapted with respect to valid 
cue v is to say that, when both are 
present, cue 7 has no effect on per- 
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formance. If cue ¿ is adapted with 
respect to several valid cues, then it 
is nonfunctional whenever it appears 
along with any of the valid cues. 
The probability that cue ¿ will be 
adapted with respect to cue v depends 
on the relative weight of cue v in the 
whole set, and also on the difference 
between 7 and v in validity. Specifi- 
cally, we may write this probability 
aS um). Letting 0, be the relative 
weight of cue v, and V, and V; be the 
validities of cues v and 7, then 


bu) = bl Ve — Vi) [8] 


Letting A (ij,x,) be the probability 
that cue i is adapted with respect to 
Cue v at trial N, n, we have 


Aaiya d A til N,n) 
X [1 — pum] + am [9] 


This formulation is a refinement of a 
Previous statement regarding adapta- 
tion of cues under conditions of prob- 
able reinforcement (11). 

The significance of the idea that cues 
are adapted with respect to specific 
other cues appears when we consider 
the adaptation of type-c cues in 
€arning-set experiments. Within 
cach problem, if it is solved, all type-¢ 
Cues are adapted. But this seems to 
ati little immediate effect on the 

€xt problem. The reason is that 
on cues are in the main adapted 
ch respect to type-b cues, which are 
anged for the next problem. When 
se change is made, the type-¢ cues 
Boa red at once from adaptation, 
have to be adapted all over again. 
al €n type-c cues happen to become 
.-*Pted to type-a cues the adaptation 
oo rmanent, because the type-@ cues 
PEO change. Thus, within cachi 
Tapign the type-c cues are amd 
A g adapted, but over many pr°?- 
enthy UY gradually become perman- 
Y adapted. 


„Reducing these observations to 
difference equations, we have ~ 


Awy,n) = 0 [10] 


Type-a cues are not adapted because 
there are no other, more valid cues 
present with respect to which they 
can be adapted. Type-b cues have a 
certain validity, V», which is less than 
1 because such concrete cues are not 
consistently reinforced over the en- 
tire experiment. Type-b cues can be 
adapted only with respect to type-a 
cues, so we may write ¢@/e) = 0a(Va 
— V). Assuming that V, = 1, since 
type-a cues are consistently reinforced, 


we have 


A bja, N41, n41) = A wlan, nlt — 0a 
X (1 W]+ a- v) [11] 


Solving Equation 11 involves a con- 
ceptual problem. If a certain type-b 
cue is present only in some problems 
(as must be expected from the struc- 
ture of a LS experiment), then the 
over-all probability that a certain cue 
ka is adapted on trial n becomes 7¢ (b/a), 
where r is the probability that k, will 
be present on a given trial. For 
present purposes it is not feasible to 
estimate 7, and we shall assume that 
r=1. We are assuming that the 
several problems are made up by re- 
shuffling a single set of type-b cues. 
This is certainly not exactly true. 
However, Riopelle (15) trained mon- 
keys on LS with a minimal number of 
stimulus objects (that is, with 7 near 
1), and then transferred the animals 
to ordinary LS (with y presumably 
lower). The absence of any con- 
sistent decrement in transfer indi- 
cates that LS is quite general, and 
that r is approximately 1. In “‘con- 
cept-formation” experiments with hu- 
mans (7, pp. 232-241), perfect per- 
formance is often maintained despite 
continuous changes in the particular 
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stimuli presented. This again indi- 
cates that variations in stimulus ob- 
jects may be more like reshuffling than 
one would think. 

If essentially the same type-b cues 
appear in most problems, we should 
apply Equation 11 on every trial 
where type-a cues appear. The solu- 
tion is 


A @ja,n,n) = 1 
-0-a — WP [12] 
Adaptation of type-c cues is some- 
what more complex, since they can be 
adapted with respect to either type-a 
or type-b cues. The probability that 
any type-c cue will be adapted to a 
type-a cue on trial n, given that it is 
not already adapted to a.type-a cue, 
is ĝa, since Va = 1 and V.=0. The 
probability that the type-c cue will be 
adapted with respect to a type-b cue, 
given that it is not already adapted 
to a type-b cue, is Væ. This state- 
ment may be related to a ‘‘cue-sam- 
pling” notion by imagining that a 
given cue can be adapted with respect 
to both type-a and type-b cues; how- 
ever, on a given trial, only one type of 
adaptation can occur. A cue cannot 
go from the state of being adapted 
with respect to neither a nor b cues 
into the state of being adapted with 
respect to both, in the space of a 
single trial. 
With this formulation we get the 
difference equations 


A cela N+ nt) = A e/a,N,n) 
X (1 = 0a) + 6. 
[13] 


and 


Acnwenty) = Acs,n,n) 
X (1 — Vab) + Vids 


= L Fan. [1 — Aenm] 
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Then the probability that c is adapted 
with respect to at least one of the two 


types of cues, A (cja42,n,n), follows the 
difference equation 


A telatd NEL nti = A (c/a4b,N,n) 
X (1 — ba — Vibe) + (02 + Vebo) 
so that 


A ciasb,n4i,i) = 1 
— O = Aenm] 

x (1 1050 — Viba) i1 [14] 
Earlier discussion of adaptation sug- 
gests that, when a new problem 
is introduced, type-c cues adapted to 
type-b cues only become released 
from adaptation, whereas cues adapted 
to type-a cues remain adapted. Then 


A c,n41,1) = 0 
so that 


A (latv41,1) = A (e/a,n41,1) 


which from Equation 13 can be shown 
to be 


A ¢e/a,N+1,1) = 1 
i le A (e/a,1,1) (1 — Oa)" 


No cues are adapted at the beginning 
of the whole LS experiment. Thus, 


A /a,1,1) = 0 
A¢jan+in) =1— (1 — 84)" 


Substituting thisexpression into Equa- 
tion 14, we have, finally, 


A ote,N+4i,3) = 1 — (1 — 6.) 
X (1 — ba — Visi [15] 


According to the discrimination 
theory (11), the probability of a cor- 
rect response is the proportion of un- 
adapted cues conditioned to the 
correct response. This probability, 
which we here call py, a), is given by 


ÊN, n) = 


D= Awra] 


(all cues %) [16] 


= 
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which may be expanded using equations 5, 6, 7, 10, 11, and 15, as 


Puy) = 


a[l — (1 — Fern) — 0] +401 — A- Ferna) — 6)" 7] 


X [(1 — a(1 — Ve) + 5 0e(1 — Oa) (1 — Vo)" 


Oa + OL — Oa(1 — Vo) P + 8.(1 — 0a) (1 — OV)" 


While Equation 17 is large and 
complicated in form, it contains only 
three main independent parameters, 
each of which may be manipulated 
separately. The parameters are ĝa, 
the proportion of type-a cues; , the 
proportion of type-b cues; and Vz, the 
validity of the type-b cues. The 
other parameters, 6. and 0, are simply 
defined in terms of the three independ- 
ent ones, for 6. = 1 — ba — %, and 
0 = 0a H 0Va. If data on many 
problems are to be averaged, we may 
let Fia) and Fo,v,1) be one-half. 

Equation 17 purports to give the 
Probability of a correct response on 
any trial of a learning-set experiment. 
Tests of the theory will use Equation 
of and certain simplified special cases 

it. 


APPLICATIONS OF THE 
THEORY TO DATA 


_ The first question is whether Equa- 
tion 17 properly describes the set of 
learning curves arising during the 
formation of learning sets. The com- 
Plicated form of the equation pre- 
cludes the use of standard curve-fitting 
Methods to test the theory. The 
More refined stochastic-model meth- 
ods are equally inaccessible, for the 
Same reason. The only method of 
evaluating the theory is to evaluate 

e parameters 0a, 6, and Vs by @ 

Method of approximations, and set 

quation 17, with those approximate 

Parameters, alongside empirical per- 
Ormance curves. Harlow (4) has 
Published learning-set functions for a 
Pee of monkeys learning object CS” 

minations. To a fair approxima- 


[17] 


tion, these data are fitted by Equation 
17 with 


0, = .0075 
0 = .350 
Vi = -10 


While these parameters are clearly 
only approximate, the correspondence 
between Equation 17 and the data is 
quite good, as shown in Fig. 1. 

Inspection of Fig. 1 reveals that 
the changes in height, slope, and shape 
of the within-problems curves are fol- 
lowed by the theoretical function. 

However, a second computation was 
performed on pattern-discrimination 
data published by Harlow and War- 
ren (5). Here, it was impossible to 
get a good fit, because the rate 
of within-problems learning, based 
mainly on 6, seems to increase during 
learning for the monkeys. The the- 
ory tacitly assumes that 0, the pro- 
portion of type-b cues, is constant 
throughout the experiment. It ap- 
pears that the monkey’s attention is 
initially directed to the edges of the 
objects presented, and thus away from 
the patterns drawn on those objects, 
since the patterns are centered on the 
objects. The monkeys probably ac- 
quire some sort of “observing re- 
sponse” (23), directing their visual 
fixations toward the centers of the 
objects. If this should happen, it 
would increase the relative weight of 
the pattern cues (type-b cues) so that 
@ would increase during the experi- 
ment. 

The success of Equation 17 for ob- 
ject-discrimination, where no special 
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Fic. 1. Comparison of theoretical learning set function, Eq. 17, with 9 = .0075, % = .350, 


Vy = .70, and some data from Harlow (4). 
panels to prevent crowding. 


visual habits are required, and its 
failure for pattern discrimination, 
where the monkey must overcome a 
pre-experimental predilection, indi- 
cate that the present theory might be 
used to isolate the formation and 
effects of observing responses.® 
In ordinary learning set, the type-a 
cues are conceived to be abstract 
conditional discrimination cues— 
“abstract” because the animal can 
learn to respond independently of the 
concrete, type-b cues present and 
“conditional” because the object 
chosen is contingent on previous rein- 
forcements or nonreinforcements of 
those objects. In discrimination 
learning-set experiments, response to 
such abstract, conditional cues is con- 
sistently reinforced. In discrimina- 
tion-reversal experiments, the object 
recently reinforced becomes the wrong 


5A theory of learning set, which depends 
mainly on observing responses as the mech- 
anism, is possible. In the present theory, 
observing responses are considered possible 
special influences on behavior, but do not play 
a central explanatory role. 


Alternate sequences of problems are shown in 


two 


object, abruptly, in the middle of the 
experiment. The monkey must learn 
a new contingency, for he must learn 
to reverse objects to the cue of an 
unexpected nonreinforcement. That 
reversals can be cued, by controlled 
stimuli, has been demonstrated in an 
experiment by Riopelle and Copelan 
(16). The mechanism to be suggested 
here for reversal learning set is es- 
sentially the one they were analyzing 
—another type of cue, which might be 
called type-a,, which in reversal learn- 
ing can come to supercede regular 
type-a cues. Since, right after rever- 
sal, the animal would have no clear 
cue based only on type-a cues (which 
object, after all, is the one which “has 
been reinforced” ?—both objects have 
been reinforced), it is reasonable to 
believe that some more fitting and pre- 
cise abstract cue may arise to control 
behavior. 

The present theory relates rather 
directly to Meyer’s equations for dis- 
crimination and reversal learning (8, 
9). Meyer noted that within-prob- 
lems learning curves from fairly 
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sophisticated animals are exponential 
in form, with a constant rate but a 
gradually increasing origin. If, in 
Equation 17 above, we assume that 


the type-c cues are almost all adapted 
(i.e., A cejathN,n) is almost 1) and the 
type-a cues are all conditioned to the 
correct response (F(an,n) = 1), then 


0. + &l1 — (1 — Fox») -Om — Bu = Ve) [18] 


ena = Oa + OL1 


Since the adaptation of type-b cues is 
very slow relative to the conditioning 
of them, a given within-problems 
learning curve would be very closely 
approximated by considering [1 — 0 
(1 — V,)}*- a constant. Elemen- 
tary algebra shows that in this case 


Pom =1— p — o) [19] 


where 8 depends on the degree of 
adaptation of type-b cues and also on 
the animal’s original preference for 
one object or the other. If almost all 
type-b cues are adapted, 6 becomes 
quite small. Since Equations 17, 18, 
and 19 all include type-a@ cues as pres- 
ent, of course, we must assume that 
we are talking, not about an informa- 
tion trial, but some later trial (Trial 
2, 3, etc.) in the problem. Equation 
19 shows that the within-problems 
learning curve, for a fairly sophisti- 
cated animal, will be nearly exponen- 
tial in shape. It is also worth noting 
that the rate of this exponential curve 
depends on 0, which is a constant €x- 
cept for possible observing responses. 
Meyer’s observation was that within- 
Problems learning curves were roughly 
exponential, with a constant rate. 
His rate, estimated from his own data, 
was .251, which corresponds closely 
ke the rate we selected for Harlow’s 
a In Harlow’s data 0 = ĝa 
bV, was .0070 + .350(.70) = -2520- 
e gradual increase in the intercept, 

Or performance on the first postinfor- 
Mation trial, according to the present 
F €ory depends on adaptation © 
Ype-b cues, and a corresponding 4€- 
Crease in £. 


— 6a(1 _ Vs) P= 


In an oddities problem, the monkey 
must select whichever object, of three 
or more, is the odd one. His per- 
formance will be no better than chance 
if he selects the object which last was 
reinforced, since that may not be odd. 
Monkeys can solve such problems, 
presumably on the basis of an appro- 
priate abstract cue, but the formation 
of a discrimination learning set seems 
to be of little help, and perhaps a 
hindrance. This observation checks 
with the theoretical interpretation. 

It should be clear that further 
analysis of data from experiments on 
reversal learning, oddities, and so 
forth, isneeded. The general method 
of formulation suggested by this the- 
ory is to hypothesize a kind of type-a 
cue corresponding to the procedure 
employed, taking account of possible 
cues the monkey could use to solve 
the problem. These conditional, ab- 
stract cues can be evaluated as to 
their relative weight (a), using Equa- 
tion 17 or reasonable modifications 
thereof. Interrelations between the 
various abstract cues are sometimes 
apparent from inspection. A number 
of tests of the theory are possible 
along these lines, and may be worth 
pursuit. 

Equation 19 is the basis of other 
tests of the theory. Using this equa- 
tion, we can analyze the performance 
of monkeys which are fairly sophisti- 
cated. The important idea is that in 
Equation 19, the parameter 6 depends 
mainly on the proportion of type-b 
cues. This is so because 0 = ba + 
0s Vo, and ba in the monkey seems to be 
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very small compared with ð. To an 
approximation, then, 0 œ @V, It 
should be recalled that type-b cues 
are concrete, differential cues. Dif- 
ferences between the discriminanda in 
color, size, form, or other properties 
are represented as sets of cues of type 
b. The relative amount of valid dif- 
ferential cues theoretically controls @ 
in Equation 19. 

An example of the use of Equation 
19 is in an experiment on additivity 
of cues, performed by Warren (21). 
The same monkeys solved a variety of 
pattern discriminations in which the 
differential cues were form (F), color 
(C), size (S), or combinations of two 
characteristics (F + C, F + Ss,C+ 
S), or combinations of all three dimen- 
sions (objects differing in’F + C +S). 
At the beginning of testing the animals 
were fairly sophisticated. 

Assuming that 8 would be the same 
for all the various problems, it was 
possible to estimate the parameters 
for the F, C, and S problems. The 
parameters selected were 


B= 35 
br Vr = .089 
bc Vo = .221 
ês Vs = .067 


These parameters were selected by 
visual curve-fitting, 8 depending on 
the intercept and 0V depending on the 
learning rate. 

Assuming that in these problems, as 
in object discrimination, the validity 
of type-b cues is about .70, so that 
Ve = Vo =.Vs = ..70, the above 
equations can be solved for øp, bc, and 
s, the proportions of type-b cues in 
the three types of problems. The 
values obtained are R 


Op = 127 
bo = -316 
Os = .096 
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From these Parameters, we seek to 
predict performance on multiple-cue 
problems, that is, the F +C,F+S, 
C=; S and FEC +S problems. 
We first note that if we let I be the 
number (or weight) of type-c cues, 
F be the weight of form cues, C be the 
weight of color cues, and S be the 
weight of size cues, we have the rela- 
tionships 

ðr = F/F4+7 
or 

F= I6p/(1 — 0s) = 1467 

C= Iðc/(1 —" 8c) = .4627 

S = Iðs/(1 — 8s) = .1067 
We now note that if form, color, and 
size cues are additive, 


9r40= F+C/F+C+I=.1467 
+.4621/.146I+.46274+7= 379 


Similarly, 


Orzs = 146I + .1067/ 

‘1462 + 1067 + I = .201 
bors = .362 
Or+c+s = .416 


By assumption, V in all these cases is 
.70, so we may at once compute the 
predicted learning rates for these 
combined-cues problems. They are 


OricVryc = .265 
Or4sVrys = .141 
bc+s Vc+s = .253 
Oric+sVrycrs = .292 


In all cases, we shall continue to as- 
sume that $ = .35, Now, we have 
computed complete predictions for 
the four multiple-cue problems, with- 
out reference to the data of such 
groups. For example, the learning 
curve for F + C problems should be 


bwn = 1 — .35(1 — 265) 


for all trials, x, except the first. This 
prediction can be compared with the 
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Fic. 2. Fitted within-problems learning curves for single cues, and predicted curves for 
combined cues, in Warren's experiment on cue-additivity. Panels 1, 2, and 3 show curves 
fitted by approximation. Panels 4, 5, 6, and 7 show a priori curves. All curves are from 
Equation 19. For parameters and rationale, see text. Data from Warren (21). 


performance of the animals on the 
F + C problems. 

The fitted curves for the three sep- 
arate cue problems, and the predicted 
curves for the combined-cues prob- 
lems, are plotted in Fig. 2. Inspec- 
tion shows that the predictions are 
quite successful, almost as good as the 
fitted curves with which we started. 

The fact that the empirical curves 
do not seem to be the same shape as 
the theoretical curves may be due, at 
least in part, to the fact that empirical 
Curves represent averages over a vari- 
ety of problems of the same type 
which differ in difficulty, and over the 
Performance of different monkeys 
which differed significantly. The 
average of a set of exponential func- 
tions, with different rates, is not itself 
an exponential function (1). The 
estimates of parameters are at best 
tough approximations, but it is en- 
Couraging to see that the predictions 
are at least approximately correct, 


he despite the irregularities of de- 
ail, 


Warren also has studied perform- 
ance of monkeys first trained on an 
F + C +S problem, then retrained 
with some of the same cues present 
but some removed (22). A drop in 
performance was observed whenever 
any cue was removed. The relative 
amount of drop caused by removal of 
various cues, or combinations of cues, 
checks well with our observed param- 
eters. For example, color had the 
greatest effect, more than form and 
size together, and form was slightly 
more important than size. The stim- 
uli used in this experiment were quite 
similar to, though not identical with, 
those used in the experiment on addi- 
tivity (21). 

Theoretically, removal of part of 
the type-b cues would result in re- 
covery from adaptation of certain 
type-c cues which had been adapted 
only with respect to the cues removed. 
The greater the proportion of cues 
removed, the more invalid cues would 
recover, and the more performance 
would drop back toward chance. The 
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quantitative results are approximately 
in accord with expectation, though in 
this case no specific calculations are 
possible. 

The above computations were 

based, in part, on the assumption that 
form, color, and shape cues are addi- 
tive. This means, essentially, that 
the cues are perceptually independent 
and have nothing in common. This 
assumption is supplementary to the 
main theory, but in this case seems 
reasonable. The theory has been 
used as a framework within which to 
state such special hypotheses, and 
test them. The test, of course, is of 
the theory and the special hypothesis 
jointly, and failures of such predic- 
tions may be attributed either to fail- 
ure of the theory or of the special 
hypothesis. This weakens the power 
of such experiments to verify the 
theory. Nevertheless, one must keep 
in mind the distinction between the 
basic theory and the special assump- 
tions, lest the theory be prematurely 
discarded because it is combined with 
an ill-conceived special assumption. 
It is by no means sure that color, 
form, and size are additive cues. 
Brightness and size are probably not 
independent cues, since they have 
the common factor of total flux of 
light. 

A more tenuous perceptual hypoth- 

esis has also been tested, using the 
present theory. The results are inter- 
esting, if inconclusive. Warren (20) 
studied the rate of solution of form dis- 
criminations as a function of the size 
of forms used. In each problem the 
two forms were of equal size, but 
some problems used small forms and 
other problems used larger forms. On 
a 3” X 3” object, forms were of 0.9, 
1.8; 2.7; 3:6, 4.5, and 5.4 sq. in. areas. 
The larger forms were more rapidly 
mastered, indicating that larger forms 
give rise to more valid cues, 


The following quantitative hypoth- 
esis was considered. Suppose that 
the weight or number of cues arising 
from the forms is proportional to the 
area of the forms, while the weight of 
cues arising from sources other than 
the forms is constant. Let I be the 
weight of cues not arising from the 
forms. The I cues are type-c, since 
only the forms are differential in War- 
ren’s experiment. The weight of cues 
arising from the forms is proportional 
to s, the size of the forms. Some of 
these form-cues are valid type-b cues, 
arising from differences between the 
forms. Other form-based cues are 
invalid type-c cues, arising from 
nondifferential characteristics of the 
forms. Let sB be the weight of the 
type-b cues arising from forms of size 
s, and sC be the weight of type-c cues 
arising from forms of size s. The pro- 
portion of type-b cues, when forms are 
of size s, will be 


sB 


"= Bfc [20] 


according to the hypothesis. 

We may estimate 6, for each size 
of forms employed, and see whether 
Equation 20 fits the obtained esti- 
mates. Since only the relative weights 
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mated from learning rate) as a function of size 
of forms, The theoretical function is Equa- 
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of type-b and type-c cues are import- 
ant, we may take Bas our unit. Equa- 
tion 20 was fitted to estimates from 
Warren’s data, with B = 1, C = 1.17, 
and I = 21.0, and the correspondence 
was found to be close (see Fig. 3). 

: This test of the theory is considered 
inconclusive, because the perceptual 
hypothesis is doubtful. It has been 
found that the saliency of a form on a 
background for the monkey depends 
on whether the form is near the edges 
of the object or not (19; and personal 
communication from A. J. Riopelle). 
Since increasing the size of a form 
which is centered on an object of fixed 
size brings the form nearer the edge of 
the object, it seems likely that a more 
complicated perceptual hypothesis 
will be needed. 


DISCUSSION 


This paper has expounded a theory 
of the formation and operation of 

learning sets” in monkeys. Using 
an earlier theory of discriminative 
learning as a basis, it has here been 
assumed that monkeys are capable 
of responding to abstract, contingent 
cues. Monkeys, with extended ex- 
perience, can transcend the concrete 
stimulus characteristics which ordin- 
arily form the basis of discriminative 
responses, and respond to conditional 
cues such as “the object which was 
correct on the previous trial,” etc. 
These contingent cues are analogous 
to the cues, whatever they may be, in 
conditional discriminations—one part 
of the compound being the characteris- 
tics of the object, the other part being 
Previous reinforcements given an 
Perceived by the animal. 

This theory emphasized the cue 
value of rewards. Some motivational 
eNe of reinforcement 18 BRE 
ia ah even in the present theory, 6° 
aa we can define “validity” and “the 

rect response,” but rewards + 


ally must have two effects. One, the 
traditional reinforcement effect, is as 
an agent to strengthen certain re- 
sponses. The other is a cue-value, 
since food rewards are themselves per- 
ceivable cues. 

The importance of rewards as cues 
has been demonstrated in a series of 
papers by Riopelle and his associates 
(14, 17, 18), who have shown that 
highly efficient performance can be 
maintained even when no reward is 
given on the first trial—so long as 
some informational cue, such as a 
marble, is used to indicate which ob- 
ject will be reinforced. In the present 
terminology, they have shown that 
type-a cues, which are usually contin- 
gent on previous rewards, can as well 
be contingent on other things, such 
as marbles, which have the same place 
in the procedure as rewards. It is, of 
course, doubtful that monkeys would 
long perform effectively if no rewards 
at all are given for correct responses, 
since the reinforcing (strengthening) 
property of rewards is essential. But 
on Trial 1, in a problem with a sophis- 
ticated animal, the reward seems to 
function mainly as information, and 
one signal is about as good as another. 
In fact, since raisins seem to capture 
the animal’s whole attention and de- 
tract from observation of the object, 
it was found that the marble signal 
was sometimes more efficient than the 
reward. 

It should be evident that the basic 
concepts of the present formulation 
are not original with the writer, but 
instead are drawn from the discussions 
of writers like Harlow, Meyer, Rio- 
pelle, and Warren. The purpose of 
this study has been to develop a quan- 
titative framework for ideas about 
learning sets which already existed in 
the literature. In the present formu- 
lation these ideas have of course been 


dified, and_the formal relation- 
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ships, such as the mechanisms of con- 
ditioning and adaptation, and the 
relationship between cue-validity and 
rates of learning, are essentially those 
which were stated in previous papers 
by the present writer. Apparently, 
despite the changes and additions, 
there is some correspondence between 
the present theory and the theoretical 
interpretations of learning-set experi- 
menters.® 
The present theoryissharply limited 
as to the variables considered. The 
main factors in performance discussed 
are 0a, the relative weight of the ab- 
stract cues, @, the relative weight of 
the concrete differential cues, and Vz, 
the validity of the type-b cues. Only 
very simple perceptual hypotheses 
were used in testing the theory. The 
main important factor which appears 
to be missing is the possible appear- 
ance of observing responses. Other 
important factors are presumably 
missing, but have not yet been identi- 
fied. 
Tests of the theory are at present 
limited to a few fairly obvious and 
simple computations. Detailed anal- 
ysis of the large body of experimental 
data, as indicated by the present 
theory, may be useful in systematiz- 
ing and clarifying some of the findings 
already available. At the same time, 
extension of the theory to formal 
statements about reversal, oddities, 
and other such problems is indicated. 
It should be noted that the present 
theory does not “explain” learning 
sets, in the sense of reducing the ob- 
served behavior to some simpler type 
of performance. Learning-set per- 
formance has not, for example, been 
shown to be merely another example 


of S-R association, or innate organiza- 


6 At least, discussions „with Harlow, Rio- 
pelle, and Warren have indicated that such 


agreements exist. 


tion. The present theory is descrip- 
tive, attempting to write equations 
which enable one to predict perform- 
ance in certain experimental settings. 
The equations do not arise from sim- 
ple first principles, but are in a real 
sense ad hoc, with the restriction that 
they reduce to an earlier theory of dis- 
crimination learning for a subject’s 
first problem, or for a subject bereft 
of type-a cues. 

Two proper criticisms of the present 
theory may be mentioned here. One 
is that the whole formulation hinges 
on the existence of type-a cues, but 
there is no direct evidence that such 
cues are present. To this the only 
answer is that the cues are hypothet- 
ical, and any hypothesis is accept- 
able until it is either disproved or re- 
placed by a better one. A deeper 
criticism would point out that type-a 
cues are not well described. Several 
crucial questions have been left unan- 
swered: Are such cues innate or 
acquired, and if acquired, when are 
they acquired? How, exactly, are 
such cues to be described, since they 
seem to defy the usual sort of S-R 
analysis? Does the existence of 
type-a cues depend on the phyloge- 
netic level, neural or receptor endow- 
ment, or “intelligence” of the subject 
employed? What kind of neural 
basis would be required? The only 
consequence of the presence of type-a 
cues, as the theory stands, is their 
effect on the performance of sophisti- 
cated monkeys—but certainly such 
performance might arise in any of a 
variety of ways. Answers to such 
questions can be obtained only by 
further investigations. It is hoped 
that the theory in its present form 
can take a place as one of the tools in 
the analysis of the mind of the mon- 
key. Whether or not the tool is use- 
ful can be decided only by test. 
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TRIAL DESIGN IN HUMAN EXPERIMENTS? 


CHARLES W. SLACK 


Harvard University 


With notable exceptions such as free 

operant behavior (12) and some pure 
phenomenological experiments (7), psy- 
chological experiments consist of trials. 
Trials constitute the “building blocks” 
out of which experiments are made, and 
that branch of the science which deals 
with the arrangement of numbers of 
trials under differing conditions and 
over Ss is the study of experimental de- 
sign. It tries to tell us how we can dis- 
tribute trials or blocks of trials so that 
we may get the most out of an experi- 
ment. 

In this paper the author will try to 

show that attention can profitably be 
paid to the ¢rial itself, to its composi- 
tion independent of the particular pa- 
rameters and variables used. Although 
many or most of our experiments are 
made up oftrials, the trials per se have 
not been studied for what they are, and 
the principles of experimental design 
have not been extended to cover what 
goes on “inside” the trial. Trials are 
the “molecules” out of which experi- 
ments are constructed, and the trials 
themselves can be considered as made 
up of elements which are the critical 
events—stimuli, responses, and instruc- 
tions. 

The aim of this paper is threefold: 
(a) To bring attention to the study of 
the design of trials as an important and 
integral branch of experimental design, 
to propose that the trial can profitably 
be studied in its own right as an 
adjunct to psychological theory and 
method; (4) to propose one possible 
model, an “event arrangement” model, 


1 This investigation has been aided in part 
by a grant from the Foundations’ Fund for 
Research in Psychiatry. 
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as being fairly adequate to cover a 
number of interesting traditional and 
novel experiments with human Ss; (c) 
to show, by example with this model, 
how models of trial design can be used 
to generate new experiments, to pro- 
vide a framework within which we can 
discuss experimental results, and, spe- 
cifically, to provide an operational way 
of describing correspondences between 
theoretically derived trial types and ex- 
perimentally obtained similarities and 
differences in response measures, 

If we are correct, trial design could 
emerge as a methodology particularly 
concerned with and suited to the needs 
of psychology and developed for those 
needs rather than borrowed from other 
sciences like agriculture. No other sci- 
ence has experiments made up of trials 
in the same sense as psychology has. 
In most psychological experiments a 
trial is not an arbitrary unit. Psy- 
chological experiments are sequences of 
events, and the fundamental nature of 
the process studied is more contingent 
upon some aspects of the sequence than 
others. We may move some things 
around, and not others. For example, 
in a simple size-constancy experiment 
(Ss judge the height of different sized 
objects at a distance and do this over 
a number of trials), we may trade or 
eliminate trials or blocks of trials, we 
may change variables and parameters 
including instructions, we may run each 
trial on a different S, and still be study- 
ing something which we call perception 
or a “perceptual type” response. How- 
ever, there is a limit to the smallness of 
the unit which we can tamper with and 
still be studying anything like percep- 
tion. If we stimulate one S and get the 
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response from another independent S, or 
if we put the response before the stimu- 
lus in each trial, we have changed the 
whole “quality” or “character” of the 
experiment, and other changes (notably 
in instructions) are necessitated. 


DEFINITIONS 


There are two general groups of char- 
acteristics which make one psychologi- 
cal experiment different from another 
when they are both made up of trials. 
In the first group we have those vari- 
ables which change from trial to trial 
and those parameters which change from 
experiment to experiment (experiment 
= block of trials) ; and, second, we have 
the ordering over time of the occurrence 
of events within the trial, between trials, 
and between blocks of trials or experi- 
ments. Thus, for example, time in the 
sense of interval belongs in the group of 
parameters and variables, but the or- 
dering of events over time is a problem 
of experimental design and is nonpara- 
metric by nature. Traditional experi- 
mental design is concerned with ar- 
rangement effects between trials and be- 
tween blocks of trials. The present 
model of trial design concerns itself with 
arrangement effects within trials. 

The usefulness of trial design, like 
any other methodology, rests upon the 
definitions of its fundamental concepts. 
I have previously rendered an analytic 
a of the impossibility of a water- 
z t a priori distinction between stimu- 
aa and response (11). Complete defi- 

ions of the event-elements which 
make up a trial are impossible without 
ee assumptions regarding the 
We onship between organism and en- 
= oa, Thus the definition of the 
pte oo of a trial can never he a 
tie ly operational procedure, if by ; op- 
an a Sg we mean that the definition 
Obtas e made before empirical data are 

ained, 

The distinction which we, as Es, make 


between stimulus and instructions, for 
example, is never really clear in any 
particular case (11) and, furthermore, 
in the case of multiple-event trials it is 
very difficult at times to state before- 
hand how many events of a particular 
sort we are really dealing with. As we 
shall see, however, the purposes of trial 
design do not really demand that we be 
able to give precise definitions of the 
components—only that we be able to 
distinguish them well enough to order 
them in the trial. 

Trials. Trials are unit parts of ex- 
periments which are blocks of trials. A 
trial is the smallest consecutive se- 
quence of event-elements which is re- 
peatable and which contains at least 
one of each of the critical event-ele- 
ments (stimulus, instruction, and re- 
sponse). A trial cannot be broken 
down any further or otherwise altered 
without altering the character of the 
experiment, i.e., without changing a 
perceptual type of experiment into a 
learning experiment. 

The event-elements within a trial re- 
fer to each other more than they do to 
other elements outside the trial. Two 
event-elements are said to refer to each 
other (and hence belong in the same 
trial) provided a main effect of the ex- 
periment would be discovered by ob- 
serving the relationship between those 
events. Thus, in an ISR trial the 
stimulus and the response within the 
trial are said to refer to each other, 
since the relationship between these two 
in effect of a perception 


events is a mal 
experiment, whereas the relationship be- 


tween, say, the stimulus of this trial 
and the response of the preceding trial 
is not a perceptual effect. 

Reference is not the same thing as 
cause and effect, however, since a re- 
sponse may intimately refer to a stimu- 
lis yet to come and may, therefore, be 
neither the cause nor the effect of that 
stimulus. In this case the response is 
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a predictive response and might, ac- 
cording to some theory, be “caused” by 
a previous stimulus not a part of the 
present trial. 

Events Which Compose Trials, Trial 
design concerns itself with the composi- 
tion of the trial. In the present model 
composition means the combination and 
permutation of event-elements, The 
critical events which make up a trial 
are, according to the model, stimuli 
(S’s), responses (R’s), and instructions 
(T’s). Although it may be an advan- 
tage in psychological theory to consider 
instructions as no different from stimuli, 
in designing trials there js great ad- 
vantage in considering them as hay- 
ing properties somewhat different from 
stimuli. 


S, R, and I are defined as critical 
changes in what we know to be ongoing 
processes. The notion of “change” or 
differential is necessary since the model 
is an event-arrangement model; and if 
an event is continuous throughout a 
trial or an experiment, it cannot be 
arranged or moved around. Otherwise, 
the use of S, R, and I does not differ 
in any important way from many of the 
most common usages in the literature. 

Stimulus. S is a change within a trial 
in the stimulating conditions, It is im- 
portant that reinforcements in learning 
trials are always S’s, as are perceptual 
and auditory displays, test items, etc. 
S may be defined in either psychologi- 
cal or physical terms depending upon 
the demands of the situation or the bias 

of the investigator. 

Response. We know that the organ- 
ism is always responding in some sense 
or other. Certain changes or rates of 
change in the R process are interesting 
and relevant, whereas others are not. 
An R is a critical differential in the out- 
put of the organism during a trial, 

Instruction. I’s are a kind of “con- 
ditioning” or super-determining stimuli 
which have the psychological property 
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of orienting or setting the organism for 
R’s (or sometimes for S’s) to come. I’s 
refer to and must precede R’s, since 
they must be not incapable of altering 
the R to- which they refer. However, 
R’s may precede S’s, in which case they 
are predictive R’s. I’s are often as- 
sumed to be unequivocal, often their 
physical Properties are more trivial than 
S’s, and often they are given in writ- 
ten and spoken words. None of these 
last characteristics is necessary, how- 
ever. Reinforcements are never consid- 
ered as I’s—even when verbally deliv- 


ered. Hence the phrase “You have 
done very well,” is, according to the 
model, an S. 


THE EVENT-ARRANGEMENT Mover 

The event-arrangement model works 
in the following way: by combining and 
permuting the elements S, R, and T 
and adhering to the restrictions, trials 
of varying complexity can be generated. 
The model provides a way of breaking 
down complex trials into simpler trials 
and of building up ideal trials out of 
component trials. All of this is done 
without directly considering the vari- 
ables of the experiments. Considera- 
tion for variables like time can be in- 
troduced later, By deducing all of the 
possible trial varieties central to a par- 
ticular Problem, issues take on a new 
light. Trials are thought of as controls 
for one another, and answers to ques- 
tions are possible, 

If it is difficult for the reader to con- 
sider trials in the abstract without know- 
ing what the variables are, he can be 
aided by the following device. Imagine 
a very simple experiment in which the 
S varies in two ways with two values 
to each way (long, short: red, black) ; 
the R can be one and only one of these 
four. The I tells the S to attend to one 
of the two aspects (length or color). 
Now if we run the trials in the order 
ISR, we are studying “perception,” and 
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if S varies randomly and I is constant 
(“What length is it?”), we can be sure 
that any correlation between S and R 
over repetitions of the trial is due to 
perceptual” accuracy, and we can de- 
fine perceptual accuracy by that cor- 
relation. It is a measure of “how much 
the S sees.” If we run the trials IRS, 
we are studying learning, and if I is 
constant (“What length will it be?”) 
and S varies in a learnable manner, we 
can be sure that any within-trial S-R 
Correlation reflects learning—“how much 
the S has learned.” If we run the 
trials SIR, we are studying something 
Conceivably having to do with percep- 
tion or memory more than learning, but 
how it differs from ISR is not obvious. 
However, SIR suggests that we should 
vary I, and if S is constant and I varies 
randomly (sometimes length, sometimes 
Color) , we might say that I-R correla- 
aon measures the degree to which the 
> can change set. With some adapta- 
tions which are obvious in each case, 
One can extend this example with its 
Simple variables to the complex trials. 


SINGLE Event TRIALS 


oem are six permutations of the 

of fh gross” events I, S, and R. Three 

i ese are not consistent with the re- 
riction that I’s must refer to an R 

Hes the same trial and must “set” 
e organism for that R.2 To do this, 
Must precede R. 


2 
pay “inconsistent” trial types are: SRI, 
Renee RIS. SRI is an “unconditioned 
less I se” trial, a simple S-R trial with a use- 
tri at the end, The RSI trial and the RIS 
sel Would be “operant” trials also with use- 
clude F If we stretch the notion of I to in- 
i th hose which “act like” reinforcements 
e model, all reinforcements are consid- 
Verbally S’s whether or not they are given 
meee We are tempted to make something 
Mode] thee trials. However, according to the 
incapab] ey are not trials because the I’s are 
refer Ple of altering the R’s to which they 
Bet jpsPethaps the I’s “remember it” or “for- 
Would be an exception, since they refer 


By elimination we have three con- 
sistent single event trials: ISR, IRS, 
and SIR. These three paradigms are 
all used fairly frequently in psychologi- 
cal experiments; as a matter of fact, 
together with the ISRS trial discussed 
later, they practically exhaust the pres- 
ently explored trial varieties. 

The ISR paradigm is the typical per- 
ception, discrimination, and sensation 
paradigm. It comes to us from psycho- 
physics and is incorporated in such a 
diversity of “experiments” as mental 
tests, projective tests, constancy experi- 
ments, absolute and difference threshold 
experiments, scaling experiments—in- 
deed, all those experiments where the 
main effects are processes which do not 
change much with time or experience. 
There is no feedback in this trial para- 
digm, the S has no knowledge of results 
within the trial. 

The IRS paradigm is one learning 
and concept formation trial used to 
study those processes which do change 
with feedback or experience, where we 
expect learning to occur. In this trial 
the organism makes an R before there 
is a significant S change and then gets 
feedback in the form of “a reinforcing 
stimulus” (discovers whether or not he 
was right, is rewarded or not, punished 
or not, etc.). This trial is the Hum- 
phreys’ paradigm trial (5) used by 
Bush and Mosteller (2) and is the sim- 
plest trial type in which learning is a 
main effect. 

The SIR trial is very much like the 
ISR trial; in those cases where $ does 
not cease before I is presented, it may 
yield identical results in R. In the case 
where there is no overlap in time be- 


ses 
backwards) and are, therefore, either not Is 
as defined by the model or are not a part of 
the trial to which they are supposed to be- 
long. The model places this restriction on Is 
and only on I’s: S’s and R’s may come any- 
where in a trial, but, if there is an I (and 
there must be at least one per trial), it must 
precede all R’s in that trial. 
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tween S and I, however, the differences 
in R observed between these two types 
of trials may be instructive. These dif- 
ferences are attributable to character- 
istics of set or attitude, “tuning the or- 
ganism,” etc. Experiments where I’s 
both precede and follow S’s have been 
used to demonstrate the existence of 
“central perceptual selective processes” 
as distinguished from “response” tend- 
encies and are important in studying 
search behavior. In a “perceptual se- 
lection” experiment by M. Lawrence * 
(9) S’s were presented, each of which 
contained a lot of various numbers, let- 
ters, and symbols. The S was a tachis- 
toscopic exposure of a projected slide. 
In some trials I’s to “Look for numbers 
(letters or symbols)” preceded the brief 
presentation of the slide, and in some 
‘trials the I’s “Write down all the num- 
bers you saw” followed the S. Since 
Ss were better at selecting the correct 
elements in the ISR condition than 
they were in the SIR condition, and 
since the presentation of the S’s was too 
brief to allow eye movements (search 
behavior), the difference in results could 
be attributable to ‘a difference in a 
central perceptual (selective) readiness 
process, or to a rapidly declining per- 
ceptual “trace.” Whether the ISR-SIR 
differences are due to forgetting or to 
the presence of the “state of readiness” 
could not be distinguished by this sim- 
ple design. 


THE IDEAL TRIALS 


The SIR-ISR difference brings up 
an important point about the ideal trial 
for any particular experiment. The 
ideal trial is a kind of a statement of 
an experiment, at the first level, in the 


3 Not to be confused with an experiment by 
Douglas Lawrence and George Coles (8), who 
used the same trial types but employed a 
recognition procedure, and hence a very sim- 
ple stimulus, and found no differences between 
ISR and SIR. 
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shortest possible form. Theoretically, 
any experiment should consist of an 
ideal trial and all the controls which 
are necessary. The necessary controls 
are completely derivable from the ideal 
statement by considering all possible 
sequences within the ideal trial which 
obey the rule: at least one of each ele- 
ment must be present in a sequence be- 
fore it can be considered a trial. 

Thus the ideal trial for the M. Law- 
rence-type experiment for studying the 
effect of change in position of I’s rela- 
tive to S is ISIR. For studying the 
position of the S’s relative to I it would 
be SISR. From either of these two 
ideal trials we may derive the two con- 
trols of SIR and ISR. 

The ISIR trial. In actual practice, 
this trial would at times (probably half 
the time) demand S’s switching his set 
before making a response: “Look for 
numbers”—exposure of S—“Write down 
letters”—R. Some of the effects might 
wash out under these conditions, since 
S would learn not to trust E. In any 
case, before the paradigm has been run 
in its ideal form, we remain ignorant of 
much concerning this process. 

The SISR trial. The fact that this 
ideal trial is possible shows that there 
are: really two alternative theoretical 
sets contained in the I’s in this experi- 
ment, one referring forward and one re- 
ferring backward to the S. As long as 
the trials SIR and ISR are run sepa- 
rately, this fact is obscured. For the 
SISR trial to make sense, I must ex- 
plicitly state to which § the R is sup- 
posed to refer. In practice, this trial 
would sound like: (a) exposure of S— 
“Write down the numbers you have 
seen (ignoring the next S)”—second 
exposure—R; or (b) exposure of s— 


4 Where differences between events of the 
same class are not considered, C= (2 “H 
(2—1)(2"—1) —1, where C= number Hes 
control trials and s, i, and r are the num! 
of S’s, I’s, and R’s in the ideal trial. 
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“Write down the numbers you are about 
to see (ignoring the last S)”—second 
exposure—R. 

The analysis of an experiment, in this 
way, forces us to discover many of it po- 
tentialities and limitations. The “either- 
or” characteristic of this paradigm, as 
we have developed it, is evident. That 
is, the R is restricted at any time to 
one and only one part of one and only 
one S, which must be considered as a 
limitation. It is a study of the ability 
of S to follow singular and exclusive di- 
rections, which come at one of two po- 
Sitions in the trial and which instruct 
him to attend to one of several aspects 
of one of two S’s occurring at different 
Positions. In the form deduced from 
the model it is a much more complete 
and exhaustive experiment and tells us 
much more about the process of selec- 
tive set. 

__The complete experiments of two 
ideal trials and two control trials can 
answer the following questions, among 
Others: 
Ta If Ss are better on ISR than on 
en then there exists a central selec- 
ive state of readiness or a forgetting 
ee or both. (These cannot be dis- 
ened by studying just these two 
rials separately.) 
ba it in ISIR, Ss are better when 
eea i I is the same in content as the 
ny I (eg, both are “numbers”) 
fet Pas the I’s are different, this 1s 
of ue to a declining trace (although, 
Nees there may be a declining 
vary which would best be studied by 
è ne time as a parameter). It may 
ing ue to the fact that the same I is be- 
5 Bea ol and thus remembered bet- 
son This can be checked by comparl- 
titi with ISR where there is no Tepe 
ion, 
oe Ss are no better, in ISIR, 
they the T’s are the same than when 
ong ae different, this may be due to 
two causes. First, effects may 


wash out due to the inclusion -of multi- 
ple I’s (Ss may not pay atterition to the 
first I after a while); this can be ascer- 
tained by comparison with SIR and 
ISR. Second, if there is no perceptual 
state of readiness operating, the main 
effects are due to a declining trace. 

4. The SISR trial enables us to see 
whether it is where the I occurs or in 
which direction it refers which is criti- 
cal, When I refers to the first S, re- 
sults should be identical with SIR. 
When I refers to the second S, results 
should be identical with ISR. Any de- 
partures which obtain are due to the 
presence of the “yninstructed” S. 


MULTIPLE Event TRIALS 


We will now turn to some trials which 
can be generated by the introduction of 
more than one of each of the elements 
S, I, and R. 

An example of a multiple R trial. A 
recent study by Eriksen (3) is a good 
example of the appropriate use of a 
multiple R trial. Ss were presented 
with a size-judgment task under the 
method of absolute judgment. Instead 
of using only one R or judgment to 
each S presentation, Æ varied the num- 


ber and nature of the judgmental R’s. 


Ss were required first to make a verbal 
hen to move a lever 


absolute judgment, t 
tionate to the size 


to a distance propor 
of the presented S, and then to make a 
“second guess” absolute judgment, ver- 
bally. The experiment was designed to 
find out whether there was an independ- 
ent relation between each of the several 
R systems and the S, or whether all 
of the R’s would function together. It 
might be that the first verbal judgment 
would serve to define the perception for 
the S, the lever R and the second verbal 
judgment being mere reflections of the 
first judgment without any independent 
relation to the S. Phrased statistically, 
the question becomes one of determin- 
ing whether there exist significant par- 
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tial correlations over trials between 
each of the R systems and the S when 
the remaining R systems are held con- 
stant. The results showed that the first 
and second verbal R’s and the move- 
ment R each had a significant correla- 
tion with the S’s when either or both of 
the other R systems were partialed out 
and that there Was some independent 
error in each of the R’s, since all of the 
R systems together gave a bettei* pic- 
ture of the S than any two. Consider- 
ing that the second verbal judgment 
Was the same as the first, except for the 
fact that it came after the first and 
after the lever-moving judgment, these 
results are, in a wa: » Surprising. Erik- 
sen concludes: 


One factor that might constitute at least 
part of this noncorrelated error is a dif- 
ference in judgmental task that confronts 
the O at the time of the second verbal 
judgment. For the first judgment O must 
attempt to deal with a continuum of size 
and to place the stimulus in one of 11 cate- 
gories along this continuum, However, at 
the end of the second judgment O is faced 
with essentially a two-category judgment. 
He has already located the stimulus along 
the size dimension and his judgment now 
is in terms of whether the stimulus is apt 
to be larger or smaller than his first judg- 
ment. This difference in judgmental task 
may be one of the factors subsumed under 
the noncorrelated error term (3, p. 358). 


Eriksen is tentatively proposing the 
possibility of the discovery of a very 
interesting effect: the way in which a 
complex R is set up and distributed 
over time is a condition of the accuracy. 
We might interpret Eriksen’s suggestion 
as that of the presence of something like 
a “scanning mechanism.” The S does 
better if the R is distributed in such a 
way that he can narrow his search in a 
systematic fashion—first a gross R, then 
a finer adjustment, etc. 
The event-arrangement model can be 
used to generate a set of controls for 
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testing the existence of a “contingent- 
R” mechanism suggested by Eriksen. 
The basic experimental trial is a double 
R trial much like Eriksen’s, except that 
we have eliminated the second or “lever” 
R and replaced it with “intervening ac- 
tivity.” We also force the S to do what 
Eriksen suspects he is doing on his 
second R, that is, to “correct” his first 
R. The S always makes his second R 
using his first R as a baseline, 

The R’s in this experiment differ with 
regard to the number of alternatives po- 
tential in each. Furthermore, any R 
may be present or absent at two places 
relative to the intervening activity. We 
are thus free to combine our two R’s 
into a single R (equal, in this Case, to 
the product of the alternatives in the 
separate R’s), and to Tun single and 
double R trials for each of the different 
R’s, 

It is believed that this Paradigm con- 
tains a set of controls to account for 
Possibilities not inherent in Eriksen’s 
design. A plan for the experiment is 
given in Table 1, 

Not all of the controls are absolutely 
essential to show the existence of “some- 
thing interesting,” but, if the results of 
this kind of experiment produced any 


TABLE 1 
DESICN For A PERCEPTION Exprrment * 


ist 2nd 3rd 4th 
ime | Time | Time Time 
Period | Period | Period Period 
1. Ideal trial type S(X) | R(A) r |R(B) 
2, Control trial type Sx) oO r R(AB) 
3. Control trial type | S X) | R(A) r jO 
4. Control trial type | S(X) | O r |R(B) 
5. Control trial type S(X) | R(A) r Rig} 
6. Control trial type S(X) | R(B) r  |R(B) 
7. Control trial type S(X) | R(A2) Fr 10 
8. Control trial type | §(X) | O r [R(B3) 
And the transpose of | S(x) R(B) r |{R(A) 
each of the above 
except 5 and 6 


Presentation of one of X pos- 
for example, S»A; R(A) =a 
response of one of A possibilities; R(B) =a response 
of one of B possibilities where B is significantly <A; 
R(AB) =a response of one of A X B possibilities; 
r = intervening activity; O = no activity. 


p 


IAE 
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consistent hierarchy and equivalence of 
the trial types in terms of subjective ac- 
curacy, it would enable us to go a long 
way in specifying the type of mecha- 
nism involved in the handling of in- 
coming information. It might turn out, 
for example, that any split-up of the R 
is better than no split-up at all; on the 
other hand, it is possible that only a 
split-up in which there are significantly 
fewer (or perhaps even more) cate- 
Sories in the second R than in the first 
R will yield a higher correlation with 
the S. We already know that Ss do 
better when they have more R-alter- 
Natives in which to express themselves. 
x No other substantial difference is ob- 
ained between these trial types in the 
Correlation with the R, then we can 
fone that, in the particular situation 
É vestigated, this multiple IR trial can 
Er reduced to a single IR trial, and 
ite problems, including questions of 

© appropriate definition of S, I, and 
ee be greatly simplified, if not 

On a different but related topic, Post- 
man has said: 


ing. it appears that an inference concern- 
orp Perceptual discrimination can be made 
of volt it is based on at least two classes 
tion Foe Thus indices of discrimina- 
Produ MA on the analysis of subjects’ re- 
ures aeons should be validated by meas- 
rat he from other procedures such 
Rania coe of recognition. Similarly, 
Verba] Is determined from distribution of 
Measy responses may be supplemented by 
on. ie ased on matching responses, and 
Stimulus olding the manipulations of the 
ONstant and the tuning of the organism 
a ems variations in at least 
Xtent et of restricted responses. To the 
€asur at the two (or more) methods of 
Concer nt lead to consistent conclusions 
have Saal subject’s discrimination we 
BSitigg fed a perceptual response dis- 
SPonse ¢ The larger the number of re- 
discrimi, lasses which we use in our tests of 
ination, the more secure are we in 


our inference of perceptual response dis- 
positions. It is not to be expected that 
two or more classes of responses governed 
by the same disposition will yield identical 
indices of discriminations, such as thresh- 
olds or constant errors. If the indices dif- 
fer, the analytic problem is to divide the 
observed systematic variance in responses 
into two major components: that due to 
the common perceptual disposition and that 
contributed by the availability or prob- 
ability of the specific responses used (10, 


p. 65). 


The multiple IR trial looks promis- 
ing as an instrument for getting the 
kind of data which Postman feels is 
necessary to define a perceptual re- 
sponse disposition. 

The 1,RyloRo. . .I„R,S trial. Eriksen’s 
experiment and Postman’s statement 
suggest a wide application for that type 
of trial where a number of R’s, each 
with its own I, follow the S. Basically, 
this trial is still a perceptual trial, since 
R’s, even though there are a great many 
of them, are still made in the presence 
of (or following presentation of) the S. 
Two further possibilities immediately 
arise. First, would there be any ad- 
vantage to the multiple IR series in the 
learning trial? A trial might then con- 
sist of having the S predict, in a va- 
riety of modes and/or the same mode 
distributed in various ways, what he 
expects the S to be. An I,R,IoRo... 
J,RnS trial would allow us to study 
learning under those conditions where 
there may be a lot of R error which 
does not affect the learning process as 
far as the S is concerned (nonpercep- 
tual error), but which otherwise would 
get in the way of our understanding of 
the learning process itself. The author 
wishes to make this suggestion very 
tentatively, however, since a number of 
problems arise which are too difficult 
for consideration here. Instead we shall 
move on to the second possibility, a 
multiple IR trial which combines the 
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potential for learning and perception, or 
discrimination, in the same trial. 

The IRSR trial. The IRSR trial is 
one in which the S is asked to make 
both a predictive or learning R and an 
antecedent, perceptual or discrimination 
R to the same S. Where there is S-R 
error in both of these systems, the trial 
paradigm is not trivial. From the point 
of view of learning, for example, we can 

ask the question: What happens to 
learning when the reinforcement is un- 
clear (when there is error in the dis- 
crimination of the reinforcing S)? The 
two- or three-choice learning paradigm 
(2), in which the S is asked to predict 
whether the S is going to be, say, red 
or black, can now be extended to those 
situations where the S is also unclear as 
to the outcome of his prediction, but 
where the E has an S-R measure of this 
lack of clarity. We tend to think of re- 
inforcing S’s as varying in strength, but 
they also vary in discriminability, and 
this is an important characteristic to be 
studied. 

The two control trials which can be 
derived from the IRSR trial are IRS 
and ISR. A simple example of the use 
of these three trials to study the rela- 
tionship between perception and learn- 
ing might be the following one, in which 
only two values of S are studied at one 
time. Let us say that, in any trial, S 
may have one of two values; that is, it 
may be big or little, the S being asked 
to state (or predict, depending on the 
trial used) which of the two values 
has been (or will be) shown to him. 
Throughout the experiment one value of 
S occurs at random more frequently 
than the other from trial to trial, so 

that there will be something for the S 
to learn. 

First the ISR trial is studied, and a 
psychometric function can be derived 
where the difference between the two 
values of S is the variable. We will 
then know what percentage of the time 
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the S can be expected to distinguish be- 
tween the two S’s in an “absolute judg- 
ment” sense, given the physical values 
of the two S’s. We then run the IRS 
trial, asking the S to predict which of 
the two values will occur (big or little), 
and we do this at selected points on 
the psychometric function—for example, 
where the S can distinguish 90%, 50%, 
and 10%. Of course what the S can be 
expected to “learn” is that one of the 
values of S is more frequent. We can 
then run the combined IRSR trial to 
see if there is any difference when it is 
run as a whole. 

Such an experiment can be expected 
to yield important consequences for 
studying the relationship between per- 
ception and learning. Can learning oc- 
cur when discrimination is very low, if 
there are enough trials? Does the na- 
ture of the process shift as the reward 
becomes unclear? Must an event be 
highly distinguishable for it to act as 
a reinforcement? 

From the point of view of perception, 
the IRSR trial seems to fit current per- 
ception theory much better than the 
traditional ISR trial. Current percep- 
tion theory stresses such factors as set, 
directive state, assumptions, expectan- 
cies, determining tendencies, etc., all of 
which are characteristic of the organ- 
ism before stimulation. The traditional 
paradigm allows us to vary these “set” 
factors only as parameters which can be 
changed from experiment to experiment, 
usually through I’s. The IRSR trial 
gives us an R measure (the first R) 
which corresponds very closely to set, 
and an opportunity to study changes in 
set which may occur from trial to trial. 
Recently there have been suggestions in 
the perception literature (1, 4) that “re- 
sponses in the absence of stimuli” will 
decide nativism-empiricism arguments. 

Two interesting correlations may evi- 
dence themselves in data collected with 
the IRSR trial which cannot be ob- 
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tained with simpler perceptual or learn- 
ing trials. The first of these is the cor- 
relation between the two R’s within the 
trial. The question can be asked: “How 
much does an individual’s directive state, 
set, or predictive R tend to determine 
his perceptual R in the case where the 
other correlations between the two R’s 
and the S are zero, or partialed out?” 
This is the familiar question of the in- 
fluence of set, which bears a great deal 
of similarity to the problem of projec- 
tion. It is also the same as the gen- 
eral question of how much perception is 
ew by what the organism brings 
ss e situation, as opposed to critical S 
at ors. It is traditional in psychologi- 
experiments to study the influence of 
set and motivation factors by varying 
"a T's and by varying differences 
ita individuals by selection pro- 
a ures. It is conceivable that changes 
Del occur from trial to trial due to 
a experiences of the S, as well as from 
faa to experiment due to altera- 
ie a Ts and the like. An R meas- 
S of set is necessary in order to dis- 

ver these phenomena if they exist. 

ie ISRS trial. The ISRS trial has 
the Tee control trials (ISR, IRS) as 
can SR trial and therefore can be 
hs ered both from a perceptual stand- 
elthe and from a learning standpoint; 
= as a basically perceptual trial 
Bean added S at the end or as a 
Endl S trial with a preceding condi- 
trial. it > Considered as a learning 
ee is the paradigm of the paired 
ae or Thorndikian (13) esperi- 
vi hia’ he a psychophysical trial, it pro: 

AA e S with “knowledge-of-results. 
Conse cases where there are large and 
. ent errors in the beginning of 4 
Sh ag knowledge of results in the 
eee an appended S makes a differ- 
me in the sense that the S’s R’s be- 
Probes e accurate over trials—are 
able ae considered as studies of learn- 
ills. In many cases, notably in 
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psychophysics, knowledge of results does 
not make much difference. Here we 
tend to feel that we are studying sensa- 
tion or a perceptual process with little 
consistent error, in which learning either 
plays no part or will play no further 
part, the process having reached a 
steady state. 

Some very interesting perception ex- 
periments have been done using this 
sort of knowledge-of-results trial. The 
Ames Demonstrations (6) are presented 
in such a manner that the S is first 
stimulated by a retinal image leading 
to a perceptual R, which R, although it 
has a strong feeling of surety attached 
to it, turns out upon further experience 
by the S to have very little correlation 
with the physical S. Ss register a high 
degree of surprise under these circum- 
stances, and this experiment is very up- 
setting for some of them. With a trial 
such as this, one would expect that 
learning might take place with repeated 
presentations. If learning can be shown 
to take place under these circumstances, 
it is very important for perceptual the- 
ory, since what we are dealing with 
here is a very stable percept of which 
the S is extremely sure and which sub- 
jectively feels as though it is highly de- 
pendent upon the physical S. If it can 
be shown that through repeated pres- 
entations of this trial a change can oc- 
cur in that percept, holding other things 
constant, then questions regarding the 
extreme adaptability of the perceptual 

rocess are automatically raised. Kil- 
patrick (6) has attempted to show that 
such learning does indeed take place. 
His Ss were presented with a room 
which looked normal, but which was 
actually distorted in such a way that 
the retinal image was almost identical 
to that produced by a normal room. 
The R of the S was a motor one in 
which he poked around the room with a 
stick or tossed balls at spots of light 
projected on a wall of the room. These 
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actions led to a feedback of a sort very 
unexpected and discomforting to the S. 
In observing changes in the S’s percep- 
tion of the shape of the room, Kil- 
patrick concludes that there are two 
processes: first, a change in the relative 
weights given to the cues, and second, 
a complete reinterpretation and reor- 
ganization of the cues themselves, so 
that, when now presented with a simi- 
lar normal room, the Ss will see it as 
distorted. The ISRS trial is an impor- 
tant tool in the study of the relation- 
ship between learning and perception, 
since, like the IRSR trial, it provides 
opportunities to study change in behay- 
ior over trials (learning) in what some- 
times is considered an unchanging (per- 
ceptual) process. 


SUMMARY 


A model involving the combination 
and permutation of the occurrence of 
events within the experimental trial was 
suggested which enables one to generate 
a variety of trial types. The events 
which make up a trial were proposed 
as stimuli, instructions, and responses. 
The definitions of these events were 
adequate enough to permit the making 
of certain restrictions on the occurrence 
of events so that a number of trials, 
some the same as and some different 
from those already in use, could be de- 
scribed and their potentialities discussed. 
The values of the study of trial design 
lie in its use as an aid in the concep- 
tualization of experiments, particularly 
in the generation of new experiments 
and in the providing of a framework 


within which we can fit experimental 
findings. 
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When Hume spoke of composite ideas, 
such as table or chair, as being com- 
pounded from simple ideas, such as 
brown or hard, he found himself facing 
the question of how one simple idea 
could resemble a second one more than 
a third one. He could handle the ques- 
tion of similarity between composite 
ideas easily enough; the degree of simi- 
larity of two composite ideas depended 
on the number of simple ideas they had 
i common. But what about similarity 
oe simple ideas themselves? What 
Pa it mean to say blue is more similar 
© green than to red? Hume could not 
ee this beyond suggesting that simi- 
arity between simple ideas is somehow 
given to us directly—a capitulation that 
could never satisfy a British empiricist 
(1; 11, pp. 585-689). 
s Let us take Hume’s predicament as 
ur starting point. Whence does his 
een arise? Its origin lies in two 
eons: a failure to note the full 
aes of attributes in terms of which 
> jects and events can be grouped and 
failure to separate the question of 
eenia similarity from that of psy- 
nene cat similarity. Consider the first 
mace What are the attributes 1m 
$ s of which objects and events can 
EPa or separated? Hume lim- 
i himself here—for a reason noted 
elow—to considering simple sensory at- 


1 
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ag the preparation of this paper. Thanks 
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Script er for critical readings of the manu- 
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tributes displayed by the object itself— 
for example, the brownness or hardness 
of a table. But a table can be defined 
in terms of many more attributes than 
these. To mention but a few possibili- 
ties, it can be defined in terms of use 
(the things done upon it), of location 
(the places where it is found), or of 
construction (how it is built). Such at- 
tributes as these are no less reducible to 
a sensory basis than the ones mentioned 
by Hume, and they suggest a possible 
answer to Hume’s question. We may 
say green seems more similar to blue 
than to red because there indeed are 
more attribute-values that green and 
blue but not red have in common, while 
there are fewer that green and red but 
not blue have in common. Hume may, 
that is to say, be able to retain his com- 
mon attribute view of similarity by ac- 
cepting a wider definition of the term 
“attribute.” ‘Thus, for instance, green 
and blue—but not red—have in com- 
mon the attribute-value of being colors 
of grass in the country, colors of water 
and hence cool colors, summer land- 
scape colors, and so on. 

But we have not fully answered the 
question yet. Some people may not 
say that green and blue are more simi- 
lar than green and red. The attribu- 
tive basis for a judgment of similarity 
may be present and yet the judgment 
not be made; and, on the other hand, 
an attributive basis for such a judgment 
may be lacking and yet the items may 
be judged similar. There would seem 
to be a difference, then, between poten- 
tial similarity and psychological simi- 
larity. Part of Hume's difficulty seems 
to arise from equating the two. This 
leads him to fill the perceiver’s head 
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with similacral (i.e., photographic) rep- 
resentations of environmental attribute- 
values, and hence to ignore the possi- 
bility that the observer may perform a 
selecting and ignoring function with re- 
gard to attribute-values which are envi- 
ronmentally available. Indeed, it is this 
view that leads him to the first omis- 
sion we noted—his failure to specify the 
full range of environmental attributes. 
For insofar as one assumes the organ- 
ism to register in similacral fashion 
what the environment offers, one must 
conceive of that environment as rela- 
tively meager in its offerings, or else the 
organism would be swamped with infor- 
mation. If, however, one conceives of 
the organism as selecting some attri- 
butes by which to compare items and 
ignoring other attributes, then it be- 
comes reasonable to suggest a differ- 
ence between potential and psychologi- 
cal similarity. Potential similarity may 
well be most usefully measured in terms 
of the number of common environmen- 
tal features that two objects or events 
are found to display—using the broad 
definition of attribute noted above, and 
determining the commonalities by re- 
liable means.” 

This definition has its basis in the old 
view of similarity as common stimulus 
elements, to be sure—but it is very dif- 
ferent from that view, since the com- 
mon-stimulus-elements approach used 
the narrow definition of attribute for 
which we criticized Hume, and that 
view was thought to apply to psycho- 
logical similarity, which we shall see is 
often not the case. Establishment of 
the degree of potential similarity be- 
tween two things provides us with an 


2The problem of just what means to use 
and how, has been treated at length by Bruns- 
wik (3) in his discussions of ecological va- 
lidity. The means are, basically, responses of 
a sample of judges—and they may be re- 
sponses as varied as reading a light meter or 
judging a painting’s colors to be warm or 
cool. The validation, then, is consensual in 


nature. 
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important baseline: It lets us validate 
or contravalidate individual reports of 
similarity. A person may, for instance, 
report that two problems are similar in 
the sense that the same solution is 
thought to be applicable to both; to 
ask whether this feeling of similarity is 
valid is to ask whether the same solu- 
tion is in fact applicable to both prob- 
lems; are they potentially similar in the 
sense of having this property in com- 
mon? Or a person may report that two 
objects are similar in the sense that 
both are thought usable for driving a 
nail into a board. To validate or con- 
travalidate this impression of similarity, 
we determine whether both objects can 
in fact be used this way. 

Broadening the definition of “attri- 
bute? and recognizing that potential 
and psychological similarity are distinct 
but related issues, suggest that the ques- 
tion of psychological similarity may lie 
at the basis of diverse kinds of psycho- 
logical research on cognition that have 
hitherto usually been considered apart 
from each other—ranging from studies 
on stimulus generalization to studies on 
learning and thinking. We submit it 
may be fruitful to inquire how some of 
this research bears on the issues we have 
raised, and this for two reasons. First 
of all, we have not yet examined the 
possible ways in which psychological 
similarity might be defined, although we 
have suggested what seems to us a ten- 
able definition for potential similarity. 
Insofar as this research really bears on 
psychological similarity, it may help us 
weigh alternative definitions of the term. 
Secondly, such exploration of definitions 
may well have experimental implications 
for further research in these diverse 
areas. 


Some DEFINITIONS OF PsycHo- 
LOGICAL SIMILARITY 


One may suggest at least four v 
of defining psychological similarity. T 
first derives from Hume’s approac® 
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Hume, in equating potential and psy- 
chological similarity, supposes that if 
common properties are present in the 
environment, they will be perceived by 
the person. Hence, impressions of simi- 
larity can be directly controlled by vary- 
ing the environment’s potential similar- 
ity in the sense of the common values 
of attributes it displays. The selecting 
and ignoring capacities of the organism 
are neglected, and psychological similar- 
ity is defined in terms of common envi- 
ronmental properties. An alternative ap- 
proach is to recognize the organism’s 
selective functions, and suggest that 
recognition of common environmental 
Properties depends upon making a com- 
mon response to instances that share 
this attribute-value. The attribute- 
oe may be environmentally present 
two items and yet the person may 
coe offer a common response to them; 
r the attribute-value may be absent 
and yet a common response be made. 
sg is not necessarily a direct rela- 
ee between psychological and poten- 
he similarity. Psychological similarity 
DR then, is defined in terms of com- 
$ n responses. Yet another possible 
“nian is to suggest there are neural 
a ces laid down along various dimen- 
ons when a stimulus impinges, and 
| ological similarity depends on how 
«he new stimulus is from the old 
iy ulus on such a dimension—the view 

primary stimulation gradients. 
7 “ fourth view may also be proposed. 
en to be sure, to the view of 
tate ee responses, but seeks to sepa- 
diy § e process of recognizing similar- 
Vet E making a common response. 
stimul fourth approach is not that of 
Sune T ion gradients, for it does not as- 
relatio at there is necessarily an inverse 
Ree nship between degree of recog- 
con oe A and distance of new 
he l stimuli along some dimension. 
fo » ees may be put somewhat as 
may ss Although a common response 
e forthcoming to signal the recog- 
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nition of similarity when two events 
occur, this response is itself a rather 
trivial matter (it may be made with the 
big toe as well as with the vocal cords, 
and need not be of fixed character), and 
its being made depends on the prior 
learning of a rule for categorizing events 
as equivalent. This rule specifies what 
property (attribute-value) or properties 
must be conjointly found in order to as- 
sign events to the same class, and also 
tells us the nature of the contrast be- 
ing made—the relevant alternative class 
whose members are to be judged non- 
exemplars of the class of interest. We 
construct this contrast class in such 
ways as the following. (a) By impli- 
cation, from the degree of complexity 
of the class of interest; since the more 
complex that class is, the more likely 
we are to consider small deviations from 
it as the only relevant alternative class 
or classes, and to totally neglect large 
deviations.* Thus, for instance, we are 
likely to contrast cocker spaniels with 
setters or poodles and not with all other 
animals, whereas “all other animals” 
would be a relevant contrast class if 
we were interested in dogs rather than 
cocker spaniels. ( b) By experience with 
all the instances, positive and negative, 
to which we are exposed, for these es- 
tablish the domain of material relevant 
to us. Construction of a contrast class 
in turn suggests which properties we 
may best use for discriminating positive 
from relevant negative instances. For 
example, while the attribute of size will 
not let us discriminate dogs from all 
other animals, it does suffice to separate 
setters from cocker spaniels. 

The classification rule that we learn, 
then, has two aspects: it tells us on 


3 See Bruner, Goodnow, and Austin (2), P- 
38; and Kelley (13), pp. 303-305. 

4The “implication” in question here js not 
one of logical necessity, but rather a strategy 


that people tend to use—and indeed a valu- 
able one—for determining the level of classifi- 
e in particular 


category discriminations to mak 
situations. 
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what basis we are to group two events 
in one class and also tells us the range 
of other events with which we are con- 
trasting positive instances of the class of 
interest—it tells us what negative in- 
stances are relevant. The rule serves 
to delimit a particular domain of events, 
and to draw a contrast between two 
subsets within that domain: positive 
and negative instances of the class in 
question. This rule stipulates that cer- 
tain properties must be found com- 
present, i.e., together, in order to place 
an instance in a given class. Such a 
rule for class-inclusion becomes harder 
to attain, the greater the number of 
properties which the rule requires to be 
compresent before inclusion in the given 
class is warranted. This is the case be- 
cause the larger the number of proper- 
ties whose compresence is required in 
this way for class-inclusion, the greater 
is the number of properties which are 
found compresent in at least some of 
the relevant negative instances as well 
as in all positive instances of the class 
in question, thus making it more diffi- 
cult to locate the difference between 
positive and negative instances. 

In brief, then, this fourth proposal 
suggests that recognition of similarity 
depends on applying a rule which leads 
one to assign items to a common cate- 
gory. There is much more to similarity 
than the response of making this as- 
signment. For prior to that, one must 
learn the rule which guides the re- 
sponse." 

Each of these ways of defining psy- 


5 Jf one prefers to argue that such a rule is 
simply a complex response, it is incumbent 
upon one to specify the responses constitut- 
ing that rule. The suggestion often made at 
this juncture is that the rule is no more than 
the verbal responses which express it. But in 
animal learning studies discussed later—where 
one cannot, therefore, say the rule is nothing 
more than verbal responses—we shall note 
that the common responses which are present 
do not in fact constitute the basis of the rule 


that is learned. 
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chological similarity has much to be 
said in its favor, but no one of them 
seems able to account for all the evi- 
dence. Our aim in the following pages 
is to suggest some possible limitations 
which various of these views face, and 
to sketch the possible fruitfulness of 
exploring more fully the one of these 
approaches that defines psychological 
similarity in terms of assigning events 
to a common category. 

If, as we have suggested, diverse 
areas of psychological research all con- 
cern psychological similarity, one is led 
to ask whether these areas may be using 
the same kind of procedure, since a 
common concern sometimes intimates a 
common procedure of inquiry. This 
seems indeed to be the case here, and 
the procedure in question is that of 
transfer—the influence of experience in 
one situation on the handling of a sub- 
sequent situation. Insofar as something 
learned in a first situation will aid per- 
formance in a second one, then the sub- 
ject will show positive transfer if he 
recognizes the two situations to be simi- 
lar; while insofar as something learned 
in that first situation will hinder per- 
formance in the second, then he will 
show negative transfer if he recognizes 
the two situations as similar. Transfer- 
ability of learning thus provides a set 
of operations for determining whether 
recognition of similarity occurred. As 
we turn to various experiments now, 
in further considering the definitions of 
psychological similarity that we have 
reviewed, we shall find that all of them 
in effect use this transferability pro- 
cedure. 


COMMON ENVIRONMENTAL PROPERTIES 


Since this is, perhaps, the most fa- 
miliar and earliest view as to the nature 
of psychological similarity, we need not 
dwell on the many kinds of evidence 
from which it arose—for instance, mo- 
tor learning studies, of which the work 
by Lewis, McAllister, and Adams (20; 
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21) provides a recent example. Com- 
monality of the environmental proper- 
ties is established by the identity of 
measurement readings of the centime- 
ters-grams-seconds variety in the two 
situations being compared (call them 
Original Learning, OL, and Transfer 
Learning, TL). The evidence indicates 
that frequently, to be sure, transfer- 
ability of learning, and hence psycho- 
logical similarity, is related to presence 
of common environmental properties in 
these two situations. But there are, on 
the other hand, cases where presenta- 
tion of common properties in OL and 
TL does not result in psychological 
similarity of the two and hence in 
greater transfer. 

Consider, for instance, a study by 
Schwarz (27). In the OL task, the 
subject learned to attach a certain re- 
Ponse to a certain stimulus. In the 
an task, he had to learn to link a dif- 
erent response to that old stimulus. If 
transfer here depends on common en- 
vironmental properties, one would ex- 
i associative interference—that is, 
a old stimulus should tend to elicit 
ie old response; it should be trans- 

tred to the TL task despite its inap- 
oria teness there. Actually, such as- 
pote interference occurred only if 
io subject was forced to pay attention 
pe cmething other than this particular 
the a allowed to attend to this task, 
Pi 1 response was not transferred de- 
Bike identity of stimulus. When the 
ee was “paying attention” he was 
ent] Ping environmental events differ- 
B than when he was “not paying at- 
“aa _ When not paying attention, 

À us identity was allowed to become 
it “peal basis for classification, 
ieee en paying attention, the tempora 
EAS between “stimulus in the 9% 
He ene ‘stimulus in the TL task 
assign R in directing the subject t° 
ent cy he OL and TL stimuli to differ- 
tinctig oe based on this temporal dis- 

» even though they were physt- 


cally identical. Assigning them to differ- 
ent classes meant that negative transfer 
was avoided. 

Or consider a study by Ellis (4) in 
which college students learned finger 
mazes. Three different mazes were used 
in all, Maze 3 consisting of Mazes 1 
and 2 linked together. One group 
learned Mazes 1 and 3, another learned 
Mazes 2 and 3, and a third learned 
Maze 3 alone. Whereas an identical- 
stimulus-elements approach would pre- 
dict marked positive transfer for the 
first two groups in their learning of 
Maze 3, in actuality those two groups 
were not significantly better in their 
learning of Maze 3 than the group 
learning that maze de novo. Objec- 
tively identical stimulus elements did 
not per se lead to recognition of simi- 
larity. 

Common environmental attributes, 
then, do not seem to be an infallible 
indicator of psychological similarity. 
Such similarity may, under these con- 
ditions, be present for the person—but 
then again it may not. 


Common RESPONSES 


It was perhaps the Kantian revolu- 
tion in philosophy that first called into 
on Hume’s conception of the or- 
ganism. as a mirror of environmental 
stimulation, and suggested instead that 
the organism is a filterer and arranger 
of environmental events. The “mental- 
ism” of Kant’s approach banned it from 


having dir i 


questi 


ect repercussions in Ameri- 
can psychology until it was realized 
that considering the organism an active 
mediator does not require descent into 
its depths, but rather can be achieved 
by noting how the organism responds to 
jts environment. One need not go be- 
yond the relation between environmen- 
tal stimuli and a person’s responses. If, 
according to this view, & person was 
found to respond the same way to 
two objectively different situations, then 
those two situations were psychologi- 
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cally similar for him. And if, on the 
other hand, a person responded differ- 
ently despite repetition of the same 
situation (common environmental prop- 
erties), then no such psychological simi- 
larity obtained. Such an approach con- 
stituted a tremendous advance. The 
Humian assumption of man as ca- 
pable merely of similacral representa- 
tions of his environment was overcome, 
and in its place the capacities of the 
organism to select, ignore, and empha- 
size environmental attributes were rec- 
ognized. The exciting effects of this 
idea can be seen, for example, in the 
early work by Shipley (28, 29). In one 
experiment, linking an eyewink response 
to both light and shock, where shock 
elicited finger withdrawal, resulted in 
light becoming adequate to elicit finger 
withdrawal too. The common response 
of winking made to both light and shock 
indicated their psychological similarity 
for the subject. Whereas we would say 
this response was a signal of this simi- 
larity, proponents of the common re- 
sponse view would hold that this re- 
sponse itself constitutes that similarity. 
The implications of this kind of ap- 
proach are ably developed by Osgood 
(25), among others, and its applicabil- 
ity to many questions of psychological 
similarity is clear. 

The common-response view leaves us, 
however, with some uneasiness. For one 
thing, those who maintain it sometimes 
seem to rely on the magical efficacy of 
the term response to provide an aura 
of explanation where none really exists. 
The premise of the approach is that 
transfer will occur, and hence two 
events will seem similar, to the extent 
to which the organism makes the same 
response to both. But from here on 
the going becomes less simple. Para- 

doxically enough, the insufficiency of 
this premise seems to be tacitly recog- 
nized, for the full view consists of an 
attempt to defend the premise’s lacks. 
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Since the hypothesized common re- 
sponse that supposedly mediates the 
transfer sometimes cannot be located 
even though transfer is found, one 
meets the rationalization that some- 
times the common response linked to 
disparate stimuli is not observable, but 
in all such cases these unobservable re- 
sponses are supposed to have developed 
as fractional anticipations of observ- 
able ones. This is an empty-organism 
theory with a new twist. The organism 
gets filled eventually, but only with 
fractions of things that were overt re- 
sponses once. 

It seems relevant to remember that 
much of the impetus behind this view 
derives not from considerations of sci- 
entific adequacy as such, but rather 
from the cultural milieu of an earlier 
behaviorism in which the model of the 
organism as capable of transformed rep- 
resentations of its environment was still 
held in disrepute, as “mentalistic.” The 
common response theory hence sought 
to make this model acceptable to the 
scientific community by framing it in 
the language of responses. This end 
has been accomplished long since, how- 
ever, so it would perhaps be advisable 
now to restrict more carefully use of the 
term “response” to its usual meaning: 
muscular and glandular reactions. We 
would note that the restraints imposed 
by the nature of muscles and glands 
imply that behavior must be more lim- 
ited and have fewer dimensions about it 
than what goes on inside the organism.* 
Why, then, limit on a priori grounds 
one’s conception of what goes on in the 
organism to fractions of behavior? It 
does not make constructs more scientific 


®One way to indicate this difference be- 
tween behavior and what goes on inside the 
organism, is in combinatorial terms; there are 
more elements that can be combined, and 
hence a greater variety of possible outcomes; 
at the neuron level, than at the level © 
muscles and glands. 
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to frame them in the same terms as ob- 
servables. Rather, the reason we pos- 
tulate constructs is that the relations 
among observables are extremely com- 
plex, and it often proves necessary to 
invest our constructs with properties 
different from those of behavior in or- 
der the better to predict that very be- 
havior. The interests of explanation 
are not served simply by filling the 
organism with analogues of behavior. 
Hospers (9) and other philosophers have 
well pointed out that such reduction to 
the familiar is not what we really mean 
by scientific explanation. 

But were our uneasiness over the 
Common-response approach to stem only 
from general considerations such as 
these, it might well be dismissed. Dis- 
Ten becomes more difficult, however, 
a one notes the gradual accumula- 
tion of experimental evidence that ques- 
ie S the sufficiency of response identity 
“a mediating transfer of learning. Con- 
Ree? for instance, a recent study by 
a Mandler, O’Dowd, and Wal- 
cs Rats were overtrained to run 

t-right-left-right at successive choice 
ay of a linear maze, and then were 
with the task of learning to run 

“4 ht-left-right-left in the same maze. 
en object was to find whether there 

on Positive transfer from learning the 
tei urn sequence to learning its mut 
mage reversal. 
ts to the response-mediation 
i ed transfer is said to depend on 
i ne of response, and the greatest 
ity of response between OL an 
ing here is the single-alternation weav- 
pa whereby the rat alternates 
choice one side to the other at successive 
Gh o points. The faster a rat runs 

a Start box to goal box, the more of 

aving motion he learns. Insofar; 


7 
B 
D, Pae J. S., Mandler, Jean M., O'Dowd, 
mg and allach, M. A. “The role of overlearn- 
e Pabien level in reversal learning.” 1° 
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then, as transfer depends on the identity 
of a weaving-alternation response in OL 
and TL, rats running faster should show 
more positive transfer than rats running 
more slowly. The results do not sup- 
port this prediction. There was no cor- 
relation between speed of running dur- 
ing OL and degree of positive transfer 
on TL, within any of the groups run. 
Furthermore, considering differences in 
running speed between groups—differ- 
ences that were created by varying hun- 
ger conditions—groups whose drive con- 
ditions resulted in their running slowly 
during either OL or TL or both showed 
positive transfer, while the group whose 
drive conditions resulted in its running 
fast during both OL and TL showed no 
transfer. Apparently, therefore, some- 
thing other than a common response of 
weaving single alternation causes the 
transfer in the present experiment. 

Or take North’s finding (23, 24) that 
the performance of rats on discrimina- 
tio& reversals was no better when the 
rat was made to correct overtly his mis-, 
takes than when mistakes were left un- 
corrected. It was expected that re- 
versal learning would be better in the 
former condition, since it permitted a 
correct response to occur after a wrong 
choice, and this correct response was 
supposed to become associated to the 
present maze stimuli, and hence be 
evoked by these stimuli on the next 
trial, thereby mediating a full-fledged 
overt correct response. Identity of re- 
sponse thus was expected to mediate 
reversal learning. Much positive trans- 
fer in the learning of successive re- 
versals occurred, but it was not due to 
response mediation. It occurred, rather, 
only when there was a sufficient amount 
of overtraining on each reversal—a re- 
sult also obtained in the study on single 
alternation cited above. Apparently, for 
the animal to classify initial and later 
situations in terms of the more abstract 
property which they in fact have in 
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common, he must be given enough prac- 
tice on the former situation so that its 
more concrete properties no longer re- 
quire all his attention. The abstract 
property in question was single alter- 
nation in the case of the first study 
noted; reversal of position response on 
encountering an error, in the case of the 
second study. The concrete properties 
involved were particular sorts of muscu- 
lar kinesthesis that occur in relation to 
particular olfactory and visual cues 
from various parts of the maze. 

To be sure, insofar as one broadens 
the definition of “response” to include 
anything going on inside the organism, 
as some theorists such as Osgood (25) 
and Mandler (22) advise, evidence of 
the above sorts does not run counter to 
the common-response view. But broad- 
ening the definition in this way would 
seem more to cloud than to clarify the 
issues. There are grounds, then, for 
suggesting that presence or absence of 
common responses is not an infallible 
indication of presence or absence of 
psychological similarity, although there 
are certainly cases where such responses 
do serve in this manner. 


PRIMARY STIMULATION GRADIENTS 


Many studies have been done in sup- 
port of a view of psychological similar- 
ity which may be stated somewhat as 

- follows. When a stimulus impinges on 
a receptor and becomes associated to a 
response, a neural gradient is laid down 
such that other stimuli will also elicit 
this response but to lesser degree, re- 
sponse elicitability decreasing with in- 
creasing distance of the new stimulus 
from the training stimulus along some 
dimension of stimulus differences—for 
instance, intensity or pitch of tones. 
The dimension of primary stimulus gen- 
eralization need not be innate, as was 
umed by those proponents of this ap- 
ase most directly in the 
proach who were mo Il (10): but 
Pavlovian tradition, like Hull (10); bu 


rather may be the product of learned 
differentiation (i.e., discrimination train- 
ing)—a coming into closer contact with 
stimulus differences—as in the manner 
Suggested by Gibson and Gibson (5). 
But however the dimensions are thought 
to arise, the position in question as- 
sumes the shape of the gradient laid 
down upon them to fall off on either 
side of the point of stimulation along 
these dimensions; and, if training and 
test stimuli are presented at discrimin- 
ably separate points along a dimension 
which the individua] has learned to dis- 
criminate, or can innately discriminate, 
then generalization is expected to occur 
along that dimension. 

These ideas of a normal distribution 
of intensity of excit 
of stimulation, and 


ilarity may be medi- 
at process but rather by 
the class to which the Subject assigns 


various! inputs... ‘The Magnitude of the 
Subject’s response to a test stimulus can 
depend, one ma 
not he Places both training and test 
stimuli in a common class. There may 
Well be individual differences as to 


Y suggest, on whether or - 
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whether these two stimuli are placed in 
the same category or not; and in ad- 
dition, experimental conditions may be 
set up in such a manner as to induce or 
discourage the placement of those two 
stimuli in the same class without vary- 
ing these stimulus presentations them- 
selves. If such diffuse factors as these 
were to cause differences in magnitude 
of response to the test stimulus—despite 
the facts that the dimension along which 
the training and test stimuli fall and 
the differences along that dimension are 
easily discriminable by the subject, and 
the training and test stimuli presented 
are the same for all these subjects— 
then the sufficiency of a stimulus-gen- 
eralization explanation for such data 
would be called into question. And, in 
fact, there do seem to be cases where 
differences of these kinds occur. 

We may note, for example, a study 
by Wickens, Schroder, and Snide (32), 
using the GSR. Adult subjects in one 
group received shock-reinforced presen- 
tations of a particular tone, and then 
were given extinction trials with one of 
three tones at varying frequency dis- 
tances from the training tone. Subjects 
in another group underwent the same 
regimen, except they heard a nonrein- 
forced click interspersed among the re- 
inforced tone presentations during train- 
ing. The group not hearing clicks— 
qua group—yielded a gradient of de- 
creasing GSR magnitude as distance 
between training and test stimuli in- 
creased; but the height and shape of 
the gradient differed on different ex- 
tinction trials, and only by averaging 
the results of large numbers of subjects 
(24 in each extinction subgroup) could 
one tease out even these not very great 
consistencies. The group hearing clicks 
showed no clear gradients at all, even 
as a group. 

Why the different results fi 
two conditions? Recall what we note 
earlier concerning the two aspects © 


rom these 
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tules for classing events as equivalent: 
that they specify the range of negative 
instances we are seeking to discrimi- 
nate, as well as the nature of positive 
exemplars of the class of interest. We 
would suggest that presence versus ab- 
sence of clicks during training resulted 
in the learning of different rules for the 
classification of test stimuli during ex- 
tinction. The group hearing clicks used 
the property “tones” as definitive for 
class-inclusion, and excluded clicks. All 
test tones were assigned to one class. 
Because of what they experienced dur- 
ing the training series, the domain of 
instances for this group consisted of 
“tones” and “clicks.” The group not 
hearing clicks, on the other hand, used 
the properties “tones, and within such- 
and-such range of pitch” as definitive 
for class-inclusion, and excluded tones 
of other pitch—these latter being the 
relevant negative exemplars for them. 
Only some test tones met the specifica- 
tions for this class. Because, then, this 
group experienced only the repeated 
tone during training—i.e., heard no 
clicks—this group’s domain of instances 
consisted of “tones of various pitches.” 
Hence, we would expect the GSR to 
the test tones far from the training tone 
to be much Jess than that to the test 
tone closest to the training tone for the 
group not hearing clicks—and this was 
in fact the case. These subjects as- 
signed only the nearest test tones to the 
same class as the training tone, further- 
away test tones being excluded. But 
we would expect no clear trends of dif- 
ferences in GSR among the test tones 
for the group hearing clicks—and this 
also was the case. These subjects as- 
signed all test tones to the same class as 
the training tone, clicks being the items 
which were excluded from the class. 
The difference between the click and 
no-click conditions, then, seems to be 
that the subject was induced to classify 
the tones differently in each case. 
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It may be that many other “generali- 
zation” experiments also involve a cate- 
gorization process like that we postulate 
for the “no-click” group in the above 
study: Test stimuli close to the train- 
ing stimulus are placed in one class with 
the latter, those further from the train- 
ing stimulus being excluded. The widths 
of the class in question vary across in- 
dividuals, and hence one finds a smooth 
normal-curve type of gradient only when 
one averages all subjects together. But 
there are further individual differences 
in classification, and, what is more, 
classifications may change as a test se- 
ries proceeds. Hence one finds, for in- 
stance, nonconsistent results over ex- 
tinction trials taking groups of subjects 
as wholes—as in a study by Grant and 
Schiller (7). That these effects are due 
to categorization differences seems a rea: 
sonable hypothesis when one considers 
work like that by Wickens et al., where, 
on our interpretation, the probabilities 
of alternative rules for classification 
were varied, and this was found to cause 
differences in magnitude of Tesponse to 
a test stimulus. 

Although primary stimulation gradi- 
ents seem able to account for some Cases 
of psychological similarity, then there 
are others in which a different process 
may well be at work. 


ASSIGNMENT To A COMMON CATEGORY 


Each of the three definitions of psy- 
chological similarity noted above has 
been found to have only limited appli- 
cability. Such is also the case with 
this fourth definition. Since, however 
it is younger than the others, its im- 
plications have not yet been thoroughly 
explored; and since it seems relevant in 
many cases where the other definitions 
cannot easily be applied, it seems ad- 
visable to examine it further with re- 
gard to its range of utility. 

According to this fourth definition 
when we have attained a rule which 


bisects a domain of events into a set 


that exemplifies a certain property or 
conjUhction of properties and a particu- 
lar COhtrasting set that does not, then 
we will transfer behavior from one to 
another of two events that fall into the 
former set, but will tend not to transfer 
behavior; from either of them to an 
event im the latter set. Again, then, 
the relation has two aspects: It tells us 
the rangis of negative instances which 
are relevaiin > -nr m- poses ef discrimi- 
nation, as well as the basis on waich 
we are to declare events Positive ex- 
emplars of the class of interest. In 
short, recognition of the similarity of 
two events depends on their being 
classed as equivalent—as exemplifying 
the Property or conjunction of proper- 
ties in question, and as standing in con- 
trast to a particular Tange of nonex- 
emplars.® _ We learn to take one or 
more particular properties as criterial 
tor sorting a domain of events. 

There Would seem to be two main 
determinants of the kinds of classifica- 
tion rules we attain: the nature of the 
instances presented (i.e. 
mental properties to whi 
been exposed); and our pre 
some bases of classificatio: 
dices against others, 
nature of the instance 
tion Preferences henc 
psychological similar 
ways.’ Let us see 
seems to contribute 
sidering two Psychol 


the environ- 
ch we have 
dilections for 
n and preju- 
Varying either the 
s or our classifica- 
e should influence 
ity in predictable 
if this approach 
anything in con- 
ogical issues: the 


(6) and Raia a to those of Goodman 


in Kliiver’s early a first approached 


pare this fourth nt Paper's aim is to com- 
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first, transposition—a matter around 
which much empirical study has cen- 
tered; and the second, distinctive simi- 
larity—KGhler’s demonstration of the 
importance of similarity in guiding at- 
tempts at more complex human prob- 
lem-solving. 


Transposition 


Some years ago, Kohler (17, pp. 217- 
227) and Koffka (15) argued that dis- 
criminating in terms of such relations 
as “brighter than” or “larger than” is 
more primitive, more basic, than dis- 
criminating in terms of absolute sen- 
sory qualities such as brightnesses and 
sizes. Spence (30) argued just as in- 
sistently, however, that responding on 
the basis of absolute stimulus qualities 
such as brightness or size is more primi- 
tive, and relational responding is a re- 
sult of the algebraic summation of re- 
sponse tendencies to absolute stimuli. 
But one may suggest a third alternative. 
Perhaps higher organisms (and certainly 
humans) can learn to respond either in 
terms of relations between stimuli or in 
terms of absolute stimulus qualities, and 
which selection they will make depends 
largely on the conditions instituted con- 
cerning classification preferences and the 
nature of the instances. 

Consider, for example, the nature of 
the instances. We would predict a 
greater tendency for subjects to re- 
spond relationally on a transfer test, 
when trained on couplets of one large 
and one small form apiece, and when 
the particular forms used vary in size 
(but not shape) across couplets, than 
if the same amount of training were 
given on one repeated couplet of a 
large and small form; in both cases re- 
sponse to the larger form is always Te- 
warded during training. In the former 
condition, the nature of the instances 1S 
such as to permit the subject to rule 
out the possibility of classifying the 
data in terms of the absolute stimulus 


quality of size, and the only class into 
which all data fit is one defined by the 
relational property of “larger than”; 
whereas in the latter condition, the 
kinds of instances presented permit clas- 
sification on the basis of absolute size 
or relative size. We could just as well, 
of course, arrange our instances so as to 
rule out the possibility of classifying on 
the basis of relative size, simply by al- 
ways rewarding choice of the same par- 
ticular stimulus form, and having it ap- 
pear sometimes paired with a larger 
form, sometimes with a smaller form. 
In a like manner, whether the subject 
will classify on an absolute, relational, 
or even some other basis depends on his 
biases for or against particular bases of 
classification, as induced by instructions 
or other means. Given variation in the 
nature of the instances and in such 
biases, it would seem one can create 
conditions that will induce the subject 
to classify the training instances, and 
hence to transfer, on any basis we de- 
sire. 

Thus, work with children by Jackson 
and Jerome (12), for instance, indi- 
cates that how the child categorizes the 
training instances depends on whether 
he is asked to decide which stimulus is 
correct after the presentation of both, 
or rather is required to indicate, after 
each stimulus is presented, whether it is 
the correct or incorrect stimulus. As 
we would expect from the differing em- 
phases on relational versus absolute clas- 
sifying implied by these different condi- 
tions of instance presentation, relational 
choices were preponderant when the 
child gave one over-all response after 
simultaneous or successive presentation 
of both stimuli, whereas absolute choices 
were in the majority when the child 
had to respond to one stimulus before 
the second was presented. Requiring 
the subject to make a separate response 
to each stimulus favors classifying on 
an absolute basis, whereas requiring the 


114 Micuarr A, 
subject to make a single response to 
both stimuli as a pair favors classifying 
on a relational basis. 

Stevenson, Iscoe, and McConnell (31) 
found that college students varied in the 
basis they used for responding on the 
first transposition test trial; many, to 
be sure, responded on a relational basis, 
but some responded on an absolute ba- 
sis. Apparently, different subjects re- 
ceived different impressions about what 
was required of them, and hence classi- 
fied the data presented to them in dif- 
ferent ways. What seem needed in this 
area, then, are studies analyzing the 
conditions of instance presentation and 
biases concerning particular attributes 
that influence the basis the subject 
chooses for classifying the training 
stimuli—conditions which therefore in- 
fluence the kind of transposition shown, 


Distinctive Similarity 


Köhler (16) uses the term 
tive similarity” for denoting t 
ticular items resemble each ot 
than they resemble th 
tion. The nature of this g 
lesser resemblance js itself 
left unanalyzed, alth 4 
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ticular, was greater for the latter group 
than for the former group; in one case 
it was “distinctive,” in the other case it 
Was not. But of what does this psycho- 
logical similarity consist? 

One may suggest that analysis of the 
Classification tasks facing the subjects in 
Kohler and von Restorff’s groups casts 
light on something which these authors 
really left shrouded in mystery—the na- 
ture of distinctive similarity. After be- 
ing shown the application of the prin- 
ciple to the initial arithmetic problem, 
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was minimal—no More than that they 
Problems,” 


junction of 
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Properties was required to 
em a positive exemplar of 
b . 1 question: the item had to 
e an arithmetic Problem and also had 
fo exhibit certain additional features: 
For this group there are more proper- 
ties compresent for positive and rele- 
vant negative instances (e.g, that the 
task be not only a problem but also 
Tn Grithmetic problem is compresent for 
oth) than for the other group. In me 
wTSt case, the domain of relevant ee 
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lems,” and the contrast is between these 
two types. In the second case, the do- 
main is “arithmetic problems,” and the 
contrast is between arithmetic problems 
with certain features and arithmetic 
problems with other features. Again, 
the nature of the instances presented 
to the subject determines the kind of 
classification attempted. In short, “dis- 
tinctive similarity” of two events means 
that the rule for assigning them to 
the same class provides for a minimum 
number of properties to be found com- 
present in at least some of the relevant 
negative instances, as well as all the 
positive instances of that class. The 
difference between positive and relevant 
negative exemplars is easier to locate. 
Could the rule for the group that does 
poorly be rendered easier by a change in 
biases of the subjects toward particular 
attributes? Let us consider the follow- 
ing variation on the Köhler and von 
Restorff experiment. Suppose we give 
another group all mathematics prob- 
lems and in the same order as the 
original group, but tell them, after the 
shortcut has been explained, that not 
all the subsequent problems are ame- 
nable to the shortcut, but that some 
will be. We have thereby set these sub- 
jects to remain vigilant throughout the 
series for the features of later problems 
that are the same as those in the initial 
one—the features that hence permit the 
principle to be applied—and would ex- 
. pect them to apply the shortcut to the 
last problem more often than did the 
group not given this set. Without 
changing the problems, then, we woul 
expect the percentage of shortcut solu- 
tions to increase because of this change 
in set; these subjects have been more 
sharply tuned to try to find what fea- 
tures of arithmetic problems permit ap- 
plication of the shortcut. They have 
been encouraged to try to take a pai- 
ticular conjunction of properties as cri- 
terial for sorting events. 
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If one wishes to study distinctive 
similarity in the transfer of principles 
for problem-solving, then, the recom- 
mendation would seem to be this. Let 
us determine the conditions of instance 
presentation and attribute biases that 
aid or hinder the arrival of the subjects 
at rules for classifying tasks in terms of 
properties which they potentially have 
in common—conditions which aid or 
hinder, in other words, a match be- 
tween psychological similarity and po- 
tential similarity. 


CoNCLUSION 


The aim of this discussion has been 
twofold. First, after pointing out the 
ubiquitousness of the issue of psycho- 
logical similarity in various areas of 
research on cognition, to note the way 
in which a common procedure, that of 
the transfer experiment, tends to unite 
them, and to urge that they be con- 
ceptualized in an integrated manner. 
Second, to consider the values and diffi- 
culties of four conceptions of psycho- 
logical similarity, defined in terms of 
common environmental properties, com- 
mon responses, primary stimulation gra- 
dients, and assignment to a common 
category. All four definitions are, to be 
sure, of limited applicability—but the 
last of them appears to apply in many 
instances where the first three do not. 
Several experimental implications of de- 
fining psychological similarity in terms 
of assignment to a common category 
were explored, and were taken to indi- 
cate that further work with this ap- 
proach may well prove fruitful. 
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CONFORMITY, STATUS, AND IDIOSYNCRASY CREDIT: 


E. P. HOLLANDER * 


Carnegie Institute of Technology 


Something of a paradox exists in the 
prevailing treatments of conformity and 
status. Students of social psychology 


are likely to be left with the pat im- 
pression that the freely chosen leader 
conforms to, and perhaps tenaciously 
upholds, the norms of his group. Yet 
this kind of leadership is also presented 
as a status sufficient to provide latitude 
for directing and altering group norms 
(11, p. 416). From their recent experi- 
mental work in this area, Dittes and 
Kelley have voiced a doubt that the re- 
lationship between conformity and status 


is ever a simple one (7, p. 106). The 


evidence favors their assertion. 
Although these phenomena may be 
treated as discrete entities, they both 
arise from interaction between an indi- 
vidual and a set of relevant other indi- 
viduals constituting a group. To say 
that an individual conforms, or that 
he has status, is not to say that these 
are independently determined states nor 


that they are terminal; they have some 


1A paper derived from this material was 
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are due to the overseers of the Group Psy- 
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common origin in a phenomenal rela- 
tionship which persists over time. Con- 
formity and status may be thought of 
therefore as mutually dependent, and 
transitionally effective upon subsequent 
interactions. With this as a frame- 
work, several general conceptions will 
be expressed here regarding mechanisms 
which produce these phenomena and 
govern their relationship to one another. 
In a gross way, three classes of vari- 
ables, or elements, are necessary to this 
conceptual scheme: characteristics of 
the individual himself; characteristics 
of the group with which he interacts; 
and outcomes of interaction represent- 
ing a past history which may alter the 
relationship of the former elements. 
Of particular importance as a medi- 
ating process is the changing perception 
brought about in the individual and the 
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group by their interaction; the third ele- 
ment is, in effect, this process. A dis- 
tinction is required, therefore, between 
the phenomenal and perceptual features 
of behavior. An individual’s behavior 
is not only phenomenally present in in- 
teraction but is also subject to view and 
appraisal by the other members of the 
group. If there are to be consequences 
involving these others, it is essential 
that there be a perceptual intake on 
their part. And so too must the indi- 
vidual perceive a group norm; the fact 
that it is manifestly there is not enough, 

It is worth emphasizing that the focus 
here is upon how the individual fares in 
the group rather than upon more global 
consequences to the group. Two kinds 
of interlocking mechanisms are of con- 
cern: those giving rise to behavior in 
conformity with group demands, and 
those giving rise to status. ‘The issues 
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at stake may be put simply as follows: 
What produces conformity? And what 
allows for nonconformity? 


SOME QUESTIONS ON CONFORMITY 


Fundamental to these issues is the 
matter of determining when an individ- 
ual may be said to be conforming. One 
may note that a twofold assumption un- 
derpins the usual view of conformity, 
ie., that the individual is aware of the 
existence of a given group norm, and 
that his behavior in accordance with 
this norm is evidence of conformity. It 
is doubtful that both features of this 
assumption necessarily hold simultane- 
ously. This being so, difñculties of in- 
terpretation will arise. If the individual 
were to be insensitive to the norm he 
could hardly be said to be conform- 
ing to it, whatever his behavior seemed 
to betray; correspondingly, a kind of 
“conformity” might prevail in terms of 
adherence to an incorrectly perceived 
norm; and thus, an evident failure to 
Conform might or might not be “non- 
conformity” depending upon the accu- 

Tacy of the individual’s Perception of 

the norm in the first place, 


y or nonconformity 
ble, once the pres- 


uate recognition of the 
nort is detailed, 


There remains too th 
who perceives a given behavior to he 
conforming, i.e., an external observer, a 
group member, or the actor himself, 
Employing a fixed-norm baseline for ob- 
servation, as is often done, serves to 
obscure differential expectations which 
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render conforming behavior for one in- 
dividual nonconforming for another— 
with regard, that is, to others’ percep- 
tions in situ. Thus, the degree of fa- 
miliarity with the unique properties of 
the group context is critical in verifying 
and understanding conformity, 


Norms, ROLES, AND Group 
EXPECTANCIES 


and defined as a 
norm. Probably because many studies 
of groups have involved highly manifest 
behaviors, norms are conceived to be 
quite literally evident, On the other 
hand, in the related concept of role a 
recognition exists that the behavioral 
standard may not be manifest, but 
rather may be an expectancy,’ 


Though persisting, the distinction be- 
tween norms and roles is neither essen- 
to maintain (cf, 2, 16). 
Roles are normative in that they in- 


ed expectancy 


among group members; and norms 


themselves, lacking visibility, may none- 


theless dwell in expectancies, It is 
these expectancj 


group. 


Objective observers might delimi 
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individuals as such. For the individual 
in the setting, however, manifest con- 
formity probably comes about without 
regard to a separate awareness of norms 
as distinct from roles, but more likely 
in terms of behaviors which he per- 
ceives to be expected of him by rele- 
vant others, i.e., “doing the right thing.” 

In the world of daily interaction, the 
perception an individual holds of what 
relevant others expect of him is a sin- 
gularly important determinant of his 
Social behavior; and the degree to which 
an individual perceives the group to be 
rewarding serves to enhance or elabo- 
rate the effect produced by his motiva- 
tion to belong. An alternative sequence 
May be seen to occur as well: motiva- 
tion having reference to some fulfillment 
through the group serves to heighten 
the individual's perception of its ex- 
Pectancies,* 


INDIVIDUAL VARIABLES 


Granted that conformity derives from 
certain features of individual perception 
and motivation, it still remains neces- 
Sary to identify these features more 
pointedly. In this formulation, there 
are four such to be noted: perceptual 
ability (Pa), representing a general 
alertness to the social stimulus field; 
perceptual error (P,), with particular 
reference to group expectancies; mo- 
tivation to gain or sustain social ap- 
Proval (Mą); and, motivation to take 
Part in the focal activities of the group 
(M,). Taken together, the latter two 
variables may be considered as the in- 
eee motivation to belong to the 
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ie 1 variables can feadily 
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be related to personality typologies. 


*The work on selective perception (ef, 
18) sustains some such formulation, in gen- 
eral, But the linkage between motivation an 
Perception has considerably greater comp ex- 
ity, as Bruner (5) has more recently pointed 
out, 
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Many of these, e.g., authoritarianism, 
rigidity, or empathy, appear to lend 
themselves to a reduction to perceptual 
function as a core element (cf. 1, 3, 
19). Terms like “perceptual rigidity,” 
“perceptual defense,” and “social im- 
perceptiveness,” often appear as con- 
comitants of these broader characteriza- 
tions; evidently, this element accounts 
for certain diversities in behavior which 
distinguish individuals from one an- 
other, 

It is useful here, however, to recog- 
nize a differential between that which is 
given and that which is emergent, i.e., 
perceptual ability and perceptual error, 
though the interaction of the two is not 
challenged. The distinction basically is 
that the former serves as a parameter 
setting the lower limit on the latter. 
Thus, the minimum level of an indi- 
vidual’s P, is set by his basic capacity, 
P,. This should not be taken as neg- 
lect of the potentials of learning, how- 
ever. The concept of capacity intro 
duced here may be understood to be 
similar to that of cognitive structures 
(cf. 15). No assumptions are made 
about the source of the “capacity”; it 
is only significant as a feature of the 
individual which bears upon interaction. 
It seems reasonable to believe that some 
individuals have an initial advantage 
over others as regards accuracy in per- 
ceiving group expectancies. 

Concerning motivation to belong, men- 
tion has already been made that it in- 
volves two continua: motivation spe- 
cific to the activity—or instrumental 
features—of the group, M,; and mo- 
tivation rooted in a generalized need for 
social approval, Mf, This view cuts 
across 4 number of other motivational 
schema suggested elsewhere (ef. 4, 8, 
9, 12, 23), and is intended more as a 
resolution than a departure, Briefly, 
these other distinctions appear to in- 
volve an “activity focus” and an “other 
people focus.” Activity involves others, 
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of course, but not necessarily to gain 
their approval. What really seems to 
matter is the nature of the reward 
sought. p 

The approval variable might be viewed 
as a parameter of personality, but not 
one so static as to be unaffected by in- 
teraction, within certain limits, Since 
those members having interests which 
can only be satisfied through participa- 
tion in group activity do not of neces- 
sity have a high need for social ap- 
proval, and since those cast into groups 
of little positive activity valence to them 
may still require approval, it is possible 
that these variables may be related 
negatively or positively, depending upon 
the circumstances considered. 


STATUS EMERGENCE 


The foregoing points have concen- 
trated on individual characteristics that 
absorb and deal with features of the 
social context. Ultimately, these have 
consequences in behavior, which in its 
turn has an impact upon the group. It 
is appropriate now to consider the im- 
plications of this process to the emer- 
gence of status, 

At bottom, status ma; 
an outcome of the 
perception of the 


y be taken to be 
group’s differentiated 
individual, leading to 
d expectancies re- 
This occurs as a 
the behaviors or 


has special value as 
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status has no intrinsic value or mean- 
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fiot be divorced from its Perceptual 
locus; hehavior is only appropriate to 
status insofar as sotneone perceives ip 
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to be so. Perceptual differentiation by 
the group has consequences, then, in 
terms of the behaviors it expects the 
individual to display.’ i 

Though not necessarily the case, it is 
desirable to conceive of status within 
this framework as having hierarchical 
Properties on some sort of sroup-accept- 
ance continuum (cf. 7). This is by no 
means Critical as a feature, but is of 
heuristic value. Still further, it is con- 
venient to represent status as permit- 
ting greater latitude in the manifesta- 
tion of behaviors which would be seen 
to be nonconformist for the other mem- 
bers of the Sroup; we refer here to com- 
mon expectancies, a term introduced 
earlier. The implications of this aspect 


of status are of especial relevance to 
What follows, 


IDIOSYNCRASY CREDIT 


Status will hereafter be considered to 
be an outcome of interaction referred to 
as “idiosyncrasy credit” (C). 


and function, 
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It is noteworthy that this concept is 
applicable to the limited, artificially pro- 
duced laboratory group as well as to the 
total society. And, since the individual 
may have simultaneous membership in 
many groups, he may be considered to 
have a distinct credit balance in all 
groups with which he is in some sense 
involved; in each case he has achieved 
some level of status. Affixed to this con- 
cept of “credit” is the further considera- 
tion that “debits” of varying magni- 
tudes may be charged against the credit 
balance, depending upon the gravity 
and frequency of the idiosyncrasy mani- 
fested, and the credit level which the 
individual holds.° 

Taking our society today as an illus- 
tration, one’s credit balance very likely 
will be rapidly exhausted by publicly 
espousing Communist doctrine. In a 
different sphere, a fraternity man may 
experience comparable rejection by his 
peers for growing a beard, though other 
factors would come into play, so that 
for some individuals the consequences 
in terms of group sanctions—would be 
disastrous and for others hardly disturb- 
ing. This requires some consideration 
of factors which determine the awarding 
of credit. 

Among other determinants, the credit 
balance that a group member achieves 
depends upon the group, its function, 
and other properties to be considered 
below. It is useful for our purposes 
here to conceive of an “open system,” 
ie, an autonomous group providing 
focal activities, as well as free face-to- 
face interaction yielding expectancies; 
this would permit the simultaneous ob- 
servation of an individual’s behavior by 

6 Alterations upward or downward in credit 
may be conceived as a negative, monotonic 
function of credit balance. Thus, for the same 
idiosyneratie behavior or negative weight at- 
tached to value, tbe tndividual with high 
status loses less credit than the marginal i- 
dividual of low status (cf. 20). 
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all group members and the generation 
of impressions representing credit. 

There are three general variables 
which can be delineated as determi- 
nants of these impressions. The first 
of these is alpha value (V«), referring 
to the individual’s task competence or 
performance in regard to focal group 
activities; the second is beta value 
(Vp), referring to characteristics of the 
individual not specific to these activi- 
ties, eg, status in a broader group, 
bonhomie, and the like; the third is 
immediate past idiosyncratic behavior 
(B), constituting a drain on credits.” 
Tt is not contended that credit is neces- 
sarily related linearly to these variables, 
nor is their very likely interrelationship 
ignored. They are doubtless intercor- 
related, though of varying degrees of 
significance in generating or dissipating 
credits. As a generalization, value (V) 
tends to increase credit while idiosyn- 
cratic behavior (B) acts to decrease 
credit—though the potential for nega- 
tive value exists, eg, in the case of 
prejudice. 


GROUP VARIABLES 


From the foregoing it should be ap- 
parent that an individual can only be 
accurate in perceiving expectancies in- 
sofar as they are normative, in some 
modal sense, and are communicated. 
Two interrelated group variables which 
have importance in this regard are group 
attraction (4) and the communicality 
of any given expectancy (Y). 

The former vatiable may be thought 


7 Still another variable related to credit bal- 
ance, probably curvilinearly, would be the du- 
ration of the individual's affiliation with the 
group over time. This has been disregarded, 
since it is useful to deal with individuals as 
though they have been in the group for an 
equal period of time, more particularly from 
its inception. It is also likely that the degree 
to which the individual is “visible” may alter 


the effects produced by his value and idio- 
syneratic behavior. 
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of as “cohesiveness,” a term more usu- 
ally applied. But since this term may 
have at least several operational mean- 
ings, it is preferable to specify two kinds 
of literal attraction, or an aggregation 
of these: attraction to group members, 
and attraction to focal group activities, 
This duality follows the M a and M, dis- 
tinction made earlier in connection with 
motivation to belong. Although one may 
deal with a nondiscriminate aggregation 
of these, it is quite true that the sum of 
individual attractions in the group may 
be based predominately in M, needs or 
M, needs, and that differences in the 
emergent characteristics of groups thus 
constituted will be evident. Thus, where 
group attraction derives mainly from Ma 
one might predict it would be more 
stable than where its source is mainly 


M,, since the latter variable is more 
temporally based. 


literally 
nt, and 
elevance 


heightened social 


e shapin 
of &roup attitudes and hence my 8 
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most fruitful one for purposes of study. 
If one were disposed to test the ten- 
ability of the hypothesis that leaders 
have this greater social sensitivity, it 
would seem desirable not only to raise 
the question of whether leaders, and 
others, tend to use their own personal 
attitudes as an anchorage for estimat- 
ing group altitudes, but to ask in addi- 
tion Whether this exists independently of 
the degree to which given attitudes actu- 
ally are foci of communication within 
real groups. Another approach, accord- 
ingly, might be to utilize some index of 
the degree to which a specific attitude 
—or an expectancy—actually evidences 
itself in a given Sroup. There is utility, 
then, in introducing the operationaliz- 
able property “communicality,” 
Though a level of communicality may 
characterize a group, the particular cen- 
ter of interest here resides in a given ex- 
pectancy. One may venture in this vein 
that the communicality of an expect- 
lower relative leye] 
expectancies, if it is 
he group as a whole; 
cies ought to have 
ity than the differ- 
associated with in- 
Since communicality 
> it may be seen to 
too, that interpersonal interac. 
tion results in higher communicality, 
mechanism, inter- 


. OM may contribute to 
Stroup attraction (cf, 11, 17), 


SUMMARY OF VARIABLES 

For convenience 

scribed ma 
tionally, 


B—Ldiosyneratic behavior, i.e., any group 
by thee roup tie Which may be perceived 
E; , 2 fll } O by] , | Ju 
th ohh Ale from a given group 

C= Hos yncrasy credit, ie, the extent 
to Which a piy ti group member’s idiosyn- 


cratic behavior (B) is allowable, in terms 


» the variables de- 
Y now þe set forth defni- 
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of gravity and frequency, before group 
sanctions are applied. 

Ve—Alpha value, i.e., the weight as- 
signed the current performance of a given 
individual, which may be perceived by the 
group as bearing upon its focal activities, 


e.g., task competence, 
Va—Beta value, ie the weight assigned 


the characteristics of a given individual 
which may be perceived by the group, but 
are not specific to its current focal activi- 
ties, e.g., status external to the group. 

P,—tThe perceptual ability of a given in- 
dividual, in the sense of a capacity to per- 
ceive events and relationships in the social 
field. 

P,—The perceptual error of a given in- 
dividual in perceiving events or relation- 
ships in a particular social field, e.g., group 
expectancies. 

M a—The motivation of a given indi- 
vidual to affiliate with a given group, in 
terms of a gaining or sustaining social ap- 
proval. 

M,—The motivation of a given indi- 
vidual to affiliate with a group, in terms of 
interest in focal group activity. 

M—Individual motivation to gain or 
sustain membership, i.e., some composite 
of a given individual’s motivation of both 
the M, and M, variety. 

A—In general, attraction of the group 
to its members, i.e., some aggregate of 
group members’ M. 

V—The communicality of a given group 
expectancy, in terms of the degree to which 


à given expectancy is evident, 
SCHEMATIC REPRESENTATION 


In Fig. 1 our symbolic notation has 
been employed to represent relation- 
ships schematically. Since a sequential 
pattern is of particular importance, 4 
time dimension is involved throughout; 
thus, subscripts are introduced to indi- 
cate the time interval to which reference 
is made; €g, 41 is read as the first time 
interval; or, Pe, as perceptual error in 
the second time interval. ’ 

The system originates at the top WT 
group attraction as a motivational Cos 
text, and three individual variables, per- 
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Fic. 1. Schematic representation of mechanisms 
demonstrating relationships over time. 


ceptual ability, motivation to gain or 
sustain social approval, and motivation 
with reference to the group’s activity. 
At the next level, group attraction has 
given rise to the communicality of cer- 
tain expectancies which are then per- 
ceived by the individual, thus yielding 
a perceptual error; and motivation to 
belong has been aggregated at this level, 
as well. Beta value is also introduced 
to signify the group’s perception of the 
individual’s characteristics, e.g., pleas- 
ant appearance. 

Moving down in time, the individual's 
idiosyncratic behavior during the pê- 
riod just elapsed has been generated by 
nis error in perceiving expectancies to- 
gether with his motivation—within the 
constraints imposed by the level of 
group attraction. ‘The group’s percep- 


8 The term “group perception” refers to an 
abstraction; it is unlikely that all members of 
a group will perceive a given feature of an 
individual identically; the intent therefore is 
only to suggest a modal tendency, However, 
some very recent research has illuminated 
jüst this poiiit and given credence to the gen- 
aral pope jon of differentially-determined 
value (ef. j. 


124 . 


tion of the individual’s contribution to 
its focal activities, alpha value, is in- 
fluenced by the immediately prior per- 
ception of his characteristics, beta value. 

In the next stage, status is gener- 
ated—in the form of group-awarded 
credits—by the effects of behavior rela- 
tive to expectancies and the sequence of 
beta value to alpha value to credits. At 
this point the full set of interactions 
are in play, with credits affecting beta 
value; the latter serving as a repository 
of group perceptions of the individual’s 
characteristics; perceptual error and 
motivation are reintroduced for this 
new phase, with the former affected by 
communicality of expectancies. Idio- 
syncratic behavior is subsequently de- 
termined by available credits, as this 
is checked by motivation, in particular, 
and perceptual error, 


DISCUSSION AND IMPLICATIONS 


Early in interaction, conformity to 
group expectancies serves to maintain 
or increase status, particularly as it is 
seen to be combined with manifest con- 
tributions to the group; at a later phase, 
however, the status thu. 
mits greater latitude 
behavior, 


early in his ex- 
posure to the group and if he shows 

f competence, he ac- 
For evident deviations 
S, OF poor performance, 
» If he exhausts his 


letel 
ee = ee pletely, hte pon 
onger 
4 member. At the 


other pole, if he continues to amass 
Credits he attains a threshold permittin g 
deviations from common expectancies 
but with constraints impo, by newly 
differentiated expectancies, 

The apparent paradox. 
both conform to group 


may act to alter them b 


—that leaders 
norms and yet 
Y an exercise of 
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influence—may be explained by refer- 
ence to this sequential process. In this 
regard, it should not be supposed that 
an abundance of credits must lead per- 
force to influence. While an individual 
thus endowed has the potential to dis- 
play more idiosyncratic behavior than 
others, he might not do so, nor would 
he of necessity become a leader thereby. 


Some further points of clarification are 


in order. 


It is easy enough for the individual 
to continue to do habitually that which 
is rewarded by relevant others, so long 
as expectancies remain relatively stable. 
Consider the state of affairs which holds, 
however, in the case of the person who 
has marked status mobility in the group. 
He cannot simply continue to redisplay 
behaviors which were appropriate to the 
group’s earlier expectancies, because the 
expectancies applicable to him 
altered in keeping with his risin 
Other things being equal, this 
two features appropriate to th 
ment of status in an o 
accuracy of social 


are now 
8g status, 
suggests 
e attain- 
pen system: (a) 
perception; and (b) 
ehavior, Insofar as 


is capable of 
them, his status will 


upward. The relati 


ers have a heigh 
tain properties 
in consonance w 


t not the leader has 
4 featiite of the coin: 
he is then familiar, he 
eed perceptual ac- 
phase, as he rose to 
ingly, the finding of 
that leaders are only bel- 
norms where they have 
heir evolution is not in- 
a finding like that of 
Newcomb (6) to the ef- 


judging 
had a part in t 
consistent with 
Chowdhry and 
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fect that leaders have superior social 
perception; it would appear that the 
time phase under study is critical in 
yielding one process or the other. 

Previously the point was made that 
the high-status person could effect 
changes in the common expectancies of 
the group because he has latitude for 
the manifestation of what would be 
seen to be nonconforming behavior for 
others. But. in contrast, the expect- 
ancies regarding the role itself are less 
amenable to alteration by the incum» 
bent. Hence, the leader could readily 
lose credits and find his influence di- 
minished if he were to show idiosyn- 
cratic behavior in terms of expectancies 
associated with his role. Regarding 
such deviation, we may conceive of one 
requirement which is quite likely sig- 
nificant, i.e., perception by the mem- 
bers of the group that the leader’s mo- 
tivation to belong be both high and 
sincere, Should this condition not be 
fulfilled, status may be threatened. To 
take another illustration, innovation by 
the leader may be of high valence to the 
group. It is conceivable that this could 
yield the seeming anomaly of a leader 
who, in the face of this expectancy, 
adopts a passive and ostensibly safe 
course, but loses status. 

Leadership status, therefore, assuredly 
demands conformity to the group’s ex- 
Pectancies regarding the role, but still 
leaves the leader with sway in the 
sphere of common expectancies associ- 
ated with members at large. The 
leader may deviate from these, oF bring 
about their reconstruction, if his print 
Activities have generated an apprepr 
ately high level of credit. 

The motiyational aspects of this prot 
&ss require consideration in tetris of the 
individual set against the background 0 

he group's activity, The restraining 
mie of M on the expenditure of 
Wha accounted for in the foregoing: 

at is more to the point, however * 
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the fact that a person with M; will 
more likely achieve status in a largely 
M, group than will a person with Ma; 
the status achieved, therefore, is a part 
function of the congruence of the indi- 
vidual’s motivation with the generalized 
character of the motivation extant in 
the group. Since motivation is related, 
as well, to performance and other char- 
acteristics represented by alpha value 


and beta value, it is to be regarded as 
a key element. 

Jf the group has a primary focal ac- 
tivity, then Srne M, becomes 
more highly valued, particularly in com- 
bination with alpha value. At the other 
extreme, i.e., where the group activities 
are quite diffuse, M, becomes important 
in combination with beta value. We 
may conceptualize groups of the Ma 
variety as being essentially “socially 
minded.” With a minimum group-cen- 
tered function requiring broad partici- 
pation, the person having beta value is 
more likely to achieve status through 
being well liked; in the M, group, to 
the contrary, alpha value becomes criti- 
cal and task competence has greater 
weight in determining status. The re- 
lationship of this to the current situa- 
tional view of leadership is evident.’ 

Whether group members do distin- 
guish between value of one kind or an- 
othet can be inferred from the literature 
of sociometry. Evidence will be found 
there indicating that individuals can 
give scaled evaluations of thelr peers 
with quite adequate discriminations be- 
tween those they like, those they con- 
sider competent, and so forth. Though 
{hese sociometrically based status ĉon- 
inua ave likely to be related, as we have 
previously noted, they are by no means 
in universal, one-to-one correspondence 


(cf. 10). Viewing status in the aggre: 


9 For example, the finding of Jennings a3) 
that leadership and popularity are highly re- 
lated in her groups can be considered in this 
framework. 
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gative, credit-amassing sense still al- 
lows for the integrity of the roots which 
feed it. Recent experimental evidence 


on the basis for shifts in the group’s 
perception of a member also accords 


with this conception (ef, 14). 

Certain of the assumptions made here 
—e.g., that an individual will have a 
level of credit reposing unitarily in 
others’ perceptions of him, and that he 
may know and make use of the credits 
at his disposal—are only approxima- 
tions of reality. Their literal tenability 
is not crucial to the mechanisms pos- 


tulated, however. One could argue that 
the individual operates as if these as- 


ceptance” which an individu 


al is per- 
Mitted to 3 


In general, then, it 
individual Seeks to 


SUMMARY 


Beginning with the consideratio t 
social behavior depends upon Parin 
of the individual, conditions of the situ- 
ation, and inputs to 4 dytiamic system 
arising from their interaction, a theo. 
retical conception relating conformity 
and status is presented, 
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The major mediating construct intro- 
duced is “Gdiosyncrasy credit,” taken to 
be an index of status, in the operational 
sense of permitting deviations from 
common “expectancies” of the group. 

Credits are postulated to increase ot 
decrease as a function of the group’s 
perception of the individual's task per- 
formance and generalized characteris- 
tics, and of his “idiosyncratic behavior,” 
ie., deviations from its expectancies. 
Though increases in credit are seen to 
permit greater latitude for idiosyncratic 
behavior, motivational and perceptual 
states of the individual, and group-level 
phenomena, are also considered, 
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ERRATUM 


In the article “Uncertainty and Conf 
tween Information-Theory and Behavi 
Berlyne in the November 1957 issue 0 


1957, 64, 329-339), there are two errors, f 
for “> 0,” and on p. 333, Expression 


“= fy” should be substituted 


ict: A Point of Contact Be- 
or-Theory Concepts” by D. E. 
f this Journal (Psychol. Rev., 
On p. 332, in Property 5, 


2 should read “S,2,(log (Sus) — log £)” 
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ACTION DECREMENT AND ITS RELATION TO LEARNING 


EDWARD L. WALKER 


University of Michigan 


It is the purpose of this paper to set 
forth some argument and some evidence 
that what is herein called action decre- 
ment and what has otherwise been la- 
beled reactive inhibition (Ip) or stimu- 
lus satiation (sI) is, at least in some 
Instances, positively related to learning. 
This is in contrast to the usually held 
Position that it is negatively related or 
antithetical to learning. Furthermore, 
it will be argued that when one tracks 
the course of the action decrement ex- 
Perimentally, one also, in fact, tracks 
the course of the perseverative consoli- 
dation which is frequently assumed to 
follow a period of training and is as- 
sumed to be necessary for subsequent 
performance. 

The use of the term action decrement 
rather than any of the other available 
terms stems from the following beliefs. 
Reactive inhibition and stimulus satia- 
tion are no longer considered suitable 
labels because of the growing belief that 
any psychological action or event is fol- 
wes by a temporary restriction Upon 
ft rearousal. It no longer seems cer- 
ae that there is an exclusive rela- 
ik ship between action decrement an! 
fen: responses or to stimulus events, al- 

ough it is admittedly difficult to deal 
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with psychological events which are not 
stimulus-aroused. In short, the posi- 
tion taken is that the action decrement 
is a central event and the relevant sym- 


bol would thus be (I Js 


The contrasting points of view with 
respect to the role of action decrements 
can be elaborated on both a common- 
sense and on a theoretical level, On a 
common-sense level it is reasonable to 
view fatigue and fatigue-like states as 
biologically useful in preventing the or- 
ganism from exceeding its resources and 
thus suffering damage. Such protection 
could be regarded as a sufficient func- 
tion. Also on the common-sense level, 
we all know, or think we know, that a 
tired organism learns very inefficiently. 
Thus common sense tells us that Hull's 
(5, 6, 7) projection of these antitheti- 
cal and detrimental effects to the theo- 
retical level is eminently reasonable. 

However, common sense sometimes 
provides contradictory propositions. It 
is possible to say that learning is not 
likely to occur without practice. Much 
practice is associated with much fatigue. 
The more one practices, the more one 
learns. Therefore, the more fatigue, the 
more learning. On a formal logical ba- 
sis the conclusion is not certain, for 
there is a logical fallacy in the sequence 
of premises. Therefore, empirical dem- 
onstrations are required to establish the 
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truth of the conclusion. Several will be 
cited subsequently in this paper. Fur- 
thermore, the protective character of 
the fatigue-like state is retained, al- 
though the object of protection is a 
perseverative process rather than the 
whole organism. 

The concept of perseverative consoli- 
dation in learning is an intriguing one. 
The concept in its most general form is 
quite simply stated. When an organ- 
ism practices an action, there continues 
for a time a process which produces 
consolidation of the effects of the prac- 
tice. If this consolidation process is in- 
terrupted or interfered with, by such 
events as a traumatic blow on the head 
or by practice on other similar mate- 
rial, as in the retroactive-inhibition 
paradigm, subsequent performance of 
the act will be deficient as a result of 
the interference, It js hypothesized 
that the action decrement is a direct 
result of the process which produces the 
perseverative consolidation, The bio- 


Sharpless and Jasper (12 


tinued or repeated stimulati 
make several distinctions, Among them 
there is a clear difference between “ha. 
bituation” and “effector fatigue.” For 
them, 


on. They 


Habituation, like learning, is clearly refer- 
able to some form of plasticity in the cen- 
tral nervous system... . It does not de- 
pend on effector fatigue, since a response 
which has become habituated to a specific 
stimulus may still be elicited by an appro- 
priate novel stimulus, whereas a fatigued 
effector will fail to respond, not only to the 
repeated stimulus, but to all other stimuli 
of the same relative intensity. 


“Action decrement” is used in this pa- 
per to mean something very similar to 
“habituation” in Sharpless and Jasper. 
It is something like effector fatigue, 
which will accumulate in sufficient 


quantities to eventually modify the 
character of the res 


some undesirable form, Action decre- ` 


ered in this paper. 


The theory can be restated in a more 
concise, although not as yet in a very 


elegant form. Any Psychological action 


and Subsequent per- 
tion decrement per- 
time and then dissi- 
nY circumstances the 


strength. 

this time is limited 
e. An attempt will 
positive, empirical 
ry, but no attempt 


i 'he argument at 
in the following sens 
be made to Present 
evidence for the theo 


RE 
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will be made to account for every phe- 
nomenon in every situation in which 
concepts such as reactive inhibition and 
conditioned inhibition have been effec- 
tively and powerfully applied as nega- 
tive factors, or factors antithetical to 
positive learning. 
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What follows is a verbal model which 
has the potentiality of becoming, un- 
der intelligent modification, a rigorously 
deductive model, a neurophysiological 
model, a mathematical model, or all 
three at once. It is not clearly any of 
these in its present form. 

Perhaps the best introduction is to 
State the problems inherent in what has 
already been said. It is fairly common 
Practice to think of perseverative con- 
Solidation in terms of reverberatory cir- 
Cults or systems. Reverberatory sys- 
tems have two basic deficiencies, from 
the present point of view. There are 
Stave suspicions that reverberatory ac- 
tivity with the initial stimulus as the 
sole impetus could not last as long as 
the consolidation process seems to last. 
Furthermore, one would expect such 
Perseveration to present a condition in 
which it is easier to rearouse the action, 
rather than more difficult. That is, if 
some fraction of an action system con- 
tinues in a state of reverberation, the 
threshold for that action should be low- 
ered, It has been hypothesized here 
that the consolidation process is indexed 
by a decrement, a raised threshold. 

Neurophysiological literature contains 
many empirical examples of the after- 
effect of stimulation in systems more 
Complex than a single fiber. One ex- 
ample which seems to offer promise may 
be seen in Fig. 1. 

This figure is redrawn from The Neu- 
*Ophysiological Basis of Mind (1) by 
Te Eccles. The data in Eccles are 

en from a study reported by Eccles 
and Rall in 1951 (2). A monosynaptic 
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Fic. 1. Changes in sensitivity as a function 
of time after stimulation in a monosynaptic 
reflex system. (Redrawn on a log scale from 
Eccles and Rall, 1, 2.) The numbers 1, 18, 
90, and 450 refer to the number of condition- 
ing pulses delivered to nerve to the Flexor 
Longus Digitorum at a rate of 300 per second. 
Ordinate values represent height of test spike 
relative to first spike from conditioning 


stimulus. 


reflex system is prepared in such a man- 
ner that the nerve trunk afferent to the 
reflex can be stimulated, and recordings 
of potentials can be taken on the effer- 
ent side of the synapse. The spike 
height of the first impulse is taken as 
the baseline (100%). The effects of 
any given number of conditioning pulses 
can then be traced by sending in a test 
pulse after a controlled delay. The 
height of the test spike is then ex- 
pressed as a proportion of the first 
spike produced by the conditioning 
pulse. In the curve labeled “1” it can 
be seen that the second pulse was less 
than the first, if the test pulse was de- 
livered any time during the first three 
seconds after the conditioning pulse. 
Thus there was an output decrement 
traceable for three seconds or more. 
If the conditioning stimulus involved 18 
pulses, in this case delivered at a rate 
of about 300 per second, no decre- 
ment is visible in the graph; but there 
is a period of postsynaptic potentiation 
lasting for at least eight seconds. In 
the curves for 90 and 450 condition- 
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ing pulses, very brief decrements are 
plotted, followed by increments of 150% 
or more. 

There are many reasons why it is 
dangerous to make much use of these 
data: these are motor neurones at 
the spinal cord level, the conditioning 
stimuli are not physiological stimuli, 
they amount to punishment of the fiber 
in a physoiological sense, etc. These 
data are offered only to support the rea- 
sonableness of the ‘following proposi- 
tions. 

1. It is possible that the activation of 
neural tissue results in a decrement in 
output and sensitivity followed by an 
increment. 

2. It is possible that the more in- 
tense the activity, the quicker the re- 
, covery, the greater the rate of recovery, 
and the greater the increment or po- 

tentiation. 

These descriptive statements—of suc- 
cessive states of the neural tissue in- 
volved in a particular action and of the 
results of variation in intensity—then 
form the basis of a model, 


choice, and it seems likely 


the units so i are fired several 


times in the process, Tt is possible to 
conceive that these units go into a dec- 
remental phase together, and in a brief 
period, possibly a second or two, re- 
cover more or less together, and then 
rise into a potentiated or supersensitive 
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phase together. This picture would fall 
somewhere between 1 and 18 on the 
Eccles and Rall graph. 

At this point let us invoke the Magoun 
Ascending Reticular System and its 
nonspecific cortical input. It is pos- 
sible to conceive of either a rain of 
nonspecific impulses or of rhythmic 
waves of arousal activity at EEG fre- 
quencies as bringing about a refiring or 
recycling of the system after it rises 
into its supersensitive phase. It is ob- 
vious that such a mechanism could in- 
volve only a fraction of the original ac- 
tion system, for the animal does not re- 
peatedly turn left at intervals from a 


fraction of a second to one or two 
seconds. 
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which the choice response is assumed to 
have occurred at the left. The fraction 
of the system goes into a decremental 
phase, and recovers into a potentiated 
phase. When it reaches an appropriate 
level, nonspecific input originating in 
some part of the reticular system can 
become effective in rearousing a frac- 
tion of the original action system, which 
then goes into a second decremental 
phase. This iterative process might 
well recur many times. 

_A further step involves the effects of 
different levels of motivation. Suppose 
one were to assume, as does Hebb (4), 
that differences in motivational level 
Correspond in some way to differences 
in nonspecific output from the reticular 
system. According to the descriptive 
statement relative to the effects of in- 
tensity drawn from Eccles and Rall, 
one would expect an increase in the 
level of nonspecific input from an in- 
Crease in motivational Jevel and thus an 
increase in the intensity of the original 
action or any subsequent cycle of it. 
This would produce a quicker recovery 
in absolute time, a steeper curve of re- 
Covery, and a greater potential incre- 
ment. Figure 2B illustrates the effect 
of a higher level of motivation on the 
shape of the cycle and the rate of itera- 
tion. 

A final step involves an assignment 
of a role for reward within the model. 
Consideration leads to the conjecture 
that any reward condition might pro- 
duce an increase in the level of non- 
Specific input, very much as if the level 
_ motivation had been raised to @ 
‘a value. This condition is pic- 
al theoretically, in Fig. 2C, where 
el is assumed to have been intro- 
a ced after the third cycle. This effect 
ane ot would fit the results of brain 
is latio studies as reported by Olds 
be OA (11), and as summarize 
ae s (10). An implanted electrode 

its tip in various parts of the brain 
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can produce approximately the same ef- 
fect on behavior as does water reward 
to thirsty animals. Both are concep- 
tualized here as producing increases in 
the level on nonspecific input. 

There are at least two factors which 
might cause this recycling process to 
cease. The integrity of the action sys- 
tem is dependent upon the condition 
that all units are originally fired at ap- 
proximately the same time. At each 
recycling, a certain number of units 
could be expected to be lost because 
they did not recover in time to partici- 
pate in the next recycling. Thus the 
size of the action system would grow 
progressively smaller through a process 
of fractionation. Furthermore, such a 
system has to some degree an afferent 
and an efferent end. One might con- 
ceive of the recycling process as being 
progressively restricted to the more ef- 
ferent end. Both factors could be ex- 
pected to produce a progressive decrease 
in the correlation between the excit- 
ability cycles of the units of the action 
system. The system would be con- 
ceived to have lost its dynamic in- 
tegrity, and no further consolidation 
would occur from the moment when the 
correlation between the units reaches 
When a unit drops out of the 


zero. 
system, it can be expected to recover 
rather quickly to a semipermanent state 


of slightly lowered threshold, and when 
all perseverative activity has stopped, the 
combined threshold changes will consti- 
tute an increment in habit strength. 
Certain characteristics of the model 
should probably be noted at this point. 
It describes a hypothetical pattern of 
behavior for a population of neural 
units which involve time units of the 
order of seconds. It may eventually 
be possible to derive this model from 
one which describes the behavior of in- 
dividual units in time units of the or- 
der of milliseconds. Second, it will be 
noted that the burden of perseveration 
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Theoretical Course of Action System in an Individual Organism 
in Terms of Probability 
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Fic. 3. Calculated probabilities of repetition for a nonrei 
and from a reinforcement condition (from t 


is in large part carried by the states of 
the units involved, where the state is a 


survive cool- 
ing in a hibernating animal. 


Cooling 
might slow or even stop the recycling 
process i 


bring it back into a Perseverative con- 
dition. Finall , the model at this point 
is largely deterministic, 
most molar level į 
abilistic model, 
Probabilities become factors in the 
model in several ways. As an action 
system rises into a potentiated Phase, 
the precise point of recycling js subject 
to more or less random Variation, F ig- 
ure 2 might well have been drawn with 
different degrees of potentiation before 
recycling. Had this been done, there 
would have been an irregularity in the 


inforcement condition (from Fig. 2A), 
he latter Part of Fig, 2C) 


cycle. Much mo: 
is the problem of the state of the sys- 
tem at the mom 
makes a second choice, 


If the system 
is in decrement, 


then the organism 
should choose the Previously unchosen 
alternative, If jt is in an incremental 
or potentiated phase, the choice should 
be to repeat the original choice. Since 
the time intervals involved in a single 
cycle in Fig, 2 should be in the general 
Tange of a second, and since experi- 
mental situations employed so far do 
not permit time Control with any pre- 
cision at this time Tange, the state of 
the system on the occasion of a second 
choice should be essentially a random 
sample of the available conditions. The 
prediction of what an animal will do on 
the occasion of a second choice can only 
be expressed in terms of probabilities. 


——— 
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The probability of repetition should be 
quite close to the ratio of time in a 
decremental state to time in a potenti- 
ated state. In Fig. 2A this ratio is 
150:50, yielding a probability ($) of 
repetition of .25. In Fig. 2B and in 
the latter part of Fig. 2C, the ratio is 
163:37, yielding a p value of 185. 
Thus the initial p values plotted in 
Fig. 3 are derived directly from Fig. 2. 

In Fig. 3 the assumption is made 
that as long as any part of the action 
system is in an active state, as por- 
trayed in Fig. 2, the probability of 
repetition will remain at a fixed value. 
When the active phase comes to an end, 
this p value will rise to a level slightly 
higher than it was before the activation 
of the system occurred originally. Ob- 
viously, repeated activation will lead 
eventually to a p value of nearly 1.00. 

It is interesting to note that Fig. 3 
conforms to the hypothesis previously 
stated by Walker (14) that the greater 
the decrement, the quicker the recovery 
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and the greater the eventual increment 
(learning). In Fig. 1 the quick re- 
covery and the greater increment are 
associated with the smaller apparent 
decrement. Thus one of the three cor- 
related propositions is inverted in the 
process of transition from the determin- 
istic level to the probabilistic one. 

Tt is tentatively assumed that the 
amount of the decrement, the time re- 
quired for recovery from the active 
phase, and the height of the increment 
will all be proportionally related. What- 
ever influences one will have a propor- 
tional effect on the other two. This 
should not only be true of experimen- 
tally manipulated variables such as level 
of motivation and amount of reward, 
but should describe individual differ- 
ences between organisms as well. That 
is, if, under the same experimental con- 
ditions, two organisms show different 
probabilities of repetition during the 
decremental phase, the one with the 
lower probability of repeating should 
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Fic. 5. Empirically determined course of action aftereffects, t 


aken from Walker (13). Figure 


4 is an approximate smooth fit to these data, 


show quicker recovery and a greater 
increment of habit strength from a sin- 
gle trial. Since there is no immediately 
apparent way to assess the probability 
of repetition in a single organism, it is 


picts this expectation. 


EMPIRICAL EVIDENCE 


The Duration of the Action Decrement 

and the Effect of Reward 

The duration anq Course of the action 
decrement have been traced in an al- 
teration study in a T maze (14). There 
were two groups of thirsty animals, 
One received water in both goal boxes; 
one group received no water. Each ani- 
mal was given a single Pair of choices 
on a given day, with the time between 
choices carefully controlled, Twenty- 
one different time Intervals, ranging 


from about 80 seconds between trials to 
something over four hours, were in- 
vestigated; there were 21 animals per 
point. The model, as specified, pre- 
dicted two things. 

First, it had previously been thought 
that action decrement, at least under its 
other names, would dissipate as an ex- 
ponential function of time. The model 


acquired in 


noted that there is something a little 


A greater tend- 
e rewarded ani- 
mals would mean that if two thirsty 
animals turn left on the first trial, and 
if one is rewarded with water and the 
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other is not, the rewarded animal should 
have a greater tendency to turn the 
other way on the subsequent trial. But 
on the basis of acquired habit strength, 
alone, it should have a greater tendency 
to repeat its first choice at all times. 
The results may be seen in Fig. 5. 

The expectation of a sustained decre- 
ment seems to be borne out by these re- 
sults. There is a greater tendency to 
alternate (or lesser tendency to repeat) 
in the rewarded animals in the early 
stages. The quicker recovery for the 
rewarded group is suggested, but no 
more, and the habit strength from a 
single reward could not be demonstrated 
with the time and resources available. 
The theoretical curves in Fig. 4 are 
actually a smooth fit to the data in 
Fig. 5. The general form of the two 
curves and the relation between them 
were established prior to the experi- 
ment, but there was no way to suggest 
appropriate repetition percentages Or 
time values. 

These results seem to provide partial 
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support for the theory and are certainly 
somewhat difficult to explain from a 
more traditional point of view. 


Motivation and Alteration 


The effort to deal with motivation has 
not been quite as successful as the ef- 
fort to deal with reward. One previ- 
ously unpublished study has been car- 
ried out using a shock grid throughout 
a T maze, with the goal boxes shock- 
free. A high level of shock (40 volts) 
was applied to the animal (his feet had 
been dipped in saline solution) through- 
out two trials. It was hoped to show a 
greater tendency to alternate than in 
the thirst and reward study, quicker 
recovery, and a greater increment in 
habit strength. There are only four. 
points plotted in Fig. 6. The decre- 
ment is greater than in the previous 
study, but the recovery rate seems 
faster, since both of the middle points 
are out of the population of points in 
the thirst-reward study. The final point, 
at 24 hours, does not constitute a sig- 
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Fic. 6. ‘The tendency to repeat choices under stror 
ith the theoretical curve for strong thirst (see Fig. 4). 


lays between trials, compared W. 


ong shock motivation with four different de- 
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nificant demonstration of habit strength 
from a single trial. 


Amount of Decrement and Ease of 
Learning 


A very different kind of study emerges 
from a restatement of the theory in the 
following way. Any factor capable of 
producing a big decrement should be 
capable of producing a big increment 
or rapid learning. 

The background for this experiment 
by Walker and Paradise (17) begins 
with a study by Glanzer (3), and a 
nearly identical one by Montgomery 
(9). Both reasoned that when an ani- 
mal alternated in a T maze, one could 
not tell whether it was now turning 
right because it had previously turned 
left, or whether it now went to the 

white side because it had tried the 
black side the last time. Both built 
+ shaped mazes that could be used as 
T mazes from either top or bottom by 
closing off one starting alley. If an ani- 
mal is given a trial from one end and 
turns left to the black, and if his second 
trial is from the other starting alley, he 
can now alternate responses and turn 
right to the black, or repeat his left 
turn and alternate stimuli to go to the 
white, but he cannot alternate with re- 
Spect to both stimulus and response. 


Both Glanser and Montgomery found 


all three— 


Ponse, or any one 
against the other two, It was found 


that stimulus was most important, and, 
by implication, was Producing the great- 
est action decrement; place was next 
most important; and response was last. 

Following that study, it was reasoned 


that the response could be made a sig- 
nificant factor in alternation if the feed- 
back from the two responses was quali- 
tatively very different. To accomplish 
this, a slightly modified + maze, called 
the side-winder or the contortion maze, 
was built. The animal was required to 
twist in three dimensions to enter the 
goal alley, and the responses to the two 
sides required two very different move- 
ment patterns. The previous design 
Was now repeated by Walker, Dember, 
Earl, Fawl, and Karoly (16), and it 
was found that the response was now a 
considerable factor in alternation. 
Since three factors could be combined 
or separated in the original (horizontal 
maze, H), and since there were four 
estimates of the differential capacity to 
Produce alternation in the side-winder 
(vertical maze, V), there were available 
eight conditions which differed in their 
Capacities to produce alternation, By 


ted in the relative 


Cues in learning, 

ing conditions, fo 
set up. In onec 
the reward Temained with the same 
stimulus, the same Place, and the same 
response, all at the same time (SPR-H 
In another, the reward 
mulus, but was random 


nimals in each group, 
are shown in Fig. 7. The top figure 
tage of repetition (the 
reverse of alternation) found to each of 
the eight factors in the previous studies. 
Thus the letters SPR-V indicate that, 
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Fic. 7. Empirical test of proposition that 
the amount of decrement is positively related 
to rate of learning. Data taken from Walker 


and Paradise (16). See text for explanation 


of conditions. 


when the animal was permitted to al- 
ternate with respect to all three in the 
vertical maze, there was very little tend- 
ency to repeat a choice but a great 
tendency to alternate, and thus a very 
large action decrement. On the other 
hand, the letters P-V mean that in the 
Vertical maze, when the animal was 0t- 
fered an opportunity to alternate stimu- 
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lus and response by repeating place, 
there was a strong tendency to repeat 
place, and, by inference, little action 
decrement. 

The middle figure shows the number 
of days required to reach a criterion of 
learning in each of eight conditions. 
There are eight trials per day under 
thirst with water reward, and the cri- 
terion is 15 correct choices in 16 trials. 
The lower figure indicates the mean 
number of errors in each group. The 
fit is reasonably close and, to the ex- 
tent that it is close, supports the theory. 


Action Decrement, Stimulus Trace, and 
Perseverative Consolidation 


It was asserted earlier that the proc- 
ess being tracked when one measures 
action decrement is perseverative con- 
solidation. It also seems essentially 
true that a stimulus trace and a mem- 
ory trace involve essentially the same 
mechanism. It was for this reason that 
the basic hypothesis of this paper was 
cast in terms of “any psychological ac- 
tion.” This process consists of a pro- 
gressive loss of members of the original 


population of neural units involved in 


the action. 
According to the present hypothesis, 


the occurrence of the event in question, 
whether a presentation of a conditioned 
stimulus or a learning trial, serves to 
correlate the sensitivity cycles of a 
population of neural units. Driven by 
nonspecific input, a part of the system 
thus created goes through a neurally 
active, iterative, recycling process. At 
each cycle some of the original popula- 
tion of neural units is lost. The rate 
of loss will determine the duration of 
the active phase and the form of any 
measure of the effectiveness of the trace 
at any given point in time. 

The form of such a trace is certainly 
a “simple decay function,” as suggested 
by Hull (5) in 1943, and in certain in- 
stances is well fitted by a power func- 
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Trace Conditioning 
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Fic. 8. Two measures of rate of learning of 
a trace-conditioned avoidance response with 
four different CS-US intervals. Data taken 
from Kamin (8), and drawn on a log abscissa, 


tion, as indicated by Hull (6) in 1950, 
Kamin (8) has recently reported data 
on trace conditioning in dogs. He used 
CS-US intervals of 5, 10, 20, and 40 


function of the duration of the trace, 
Certain of his measures of rate of learn- 


é 40-second group 
failed to learn to criterion, 


recycling of the 
ceived of as contribution to the habit 
strength acquired on that tria] in pro- 
portion to the number of the original 
neural units remaining in the system in 
that cycle. If one shoulq interrupt the 
recycling process at any given time 
after the learning trial, the amount of 
habit strength laid down by that trial 
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should be describable as a linear func- 
tion of log time between the trial and 
the interruption. 

Unfortunately, it is extremely difficult 
to demonstrate the quantity of habit 
strength acquired on a single trial, and 
therefore even more difficult to demon- 
strate that a portion of that amount 
was lost through a given interruption. 
However, training can be massed so 
that one trial follows another in rapid 
succession and a large number of trials 
forcibly reactivate the action system 
without appreciable decay between trials. 
Under these conditions, a sufficient quan- 
tity of potential habit strength can be 
built up so that one would expect dem- 


in perseverative 


of training and 
If the memory trace 
he stimulus trace in 
(Fig. 8), then the 


them a simple Stayness-discrimination 
problem for three days. They were 
motivated by shock applied if they 
stayed in the Starting box more than 5 
seconds or jf they failed to choose a 
window after More than 30 seconds. 
There is no Statement of how many 
shocks were required or through how 
much of the training period shocks 
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Fic. 9. A comparison of 
with empirical material on the e 
aken from Thompson and Dean (12), 


were continued. Finally, Thompson 
and Dean gave their animals massed 
training on an easy horizontal-vertical 
Problem. Each animal was trained on 
this problem in the shortest possible pe- 
riod of time; an average of about 28 
minutes seems a likely estimate from 
their description. One group of such 
animals was given electroconvulsive 
shocks 10 seconds after the end of this 
training, another group was shocked 
two minutes after training, a third 
group one hour after training, a fourth 
group four hours after training, and a 
Control group was not subjected to 
shocks. Forty-eight hours later they 
were retrained, and the relative savings 

in relearning were computed. 
rae results may be seen in Fig. 9- 
diet dotted line is the theoretical pre- 
the ion. The solid one is a replot of 
aad error results reported by Thompson 
: Dean, now placed on a log scale. 
vate closeness is probably ad- 
Cernu From the theory, one would 
meee the duration of the action decre- 
to vary widely with the relative 


a theoretical prediction of th 
fiects of ECS on retention of 
and replotted here on a 
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log abscissa. 


ease or difficulty of the problems pre- 
sented, with the type and level of mo- 
tive, with the magnitude and delay of 
reward, and with many other factors. 
The Walker study (14), on which the 
theoretical curve is predicated, and the 
study by Thompson and Dean (13) 
differed on all of these dimensions. 
Either the period of decrement and 
perseverative consolidation is much less 
subject to variation through the ma- 
nipulation of these variables, or the 
closeness of fit in Fig. 9 is a most 
happy pair of combinations of variables. 

In summary, it can be argued that 
there is some merit to the position that 
the action decrement represents a trace 
of the perseverative consolidation proc- 
ess, that it is thus positively related to 
Jearning, at least in some instances, and 
that one can expect operations which 
produce rapid learning to produce great 
decrements. Wherever it is applicable, 
this theory is in complete contradiction 
to the position that the fatigue-like dec- 
rement is detrimental to or antithetical 
to habit strength and learning. 
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NONCONSTRUCTURAL ASPECTS OF PSYCHOLOGICAL 
CONSTRUCTS 


W. W. MEISSNER 


Saint Louis University 


Much of what has been written on 
the subject of psychological theory con- 
struction, both on the part of psycho- 
logical theorists and on the part of the 
philosophers of science, has been di- 
rected to the analysis and determination 
of the nature and function of the con- 
cepts which are found in such theore- 
tical systems. The emphasis seems to 
have centered on the refinement of the 
methodology by which such concepts are 
formed, and on the meaning status of 
the constructs as they function in the 
system. Thus, operationism and the 
constructural method have received ex- 
tensive treatment. They provide the 
method and the meaning for a scientific 
psychology. 

Careful examination of the current 
psychological systems will reveal certain 
conceptual elements which function in 
the formulation and application of these 
theories, but which do not fall under the 
aegis of the formally defined and ac- 
cepted methods by which the systems 
have been constructed. This paper will 
attempt to indicate some of these non- 
constructural aspects, and briefly dis- 
cuss one or two of the implications that 
flow from them. 

Obviously, there are nonsystematic, 
nonscientific, nonconstructural opera- 
tions and processes that enter into the 
construction of a theoretical system by 
way of necessary preliminaries. The 
scientist must read instruments, take 
measurements, feed rats, and a thousand 
other things, which are not directly in- 
corporated into his theory, but which 
provide the data from which he will 
ultimately construct his system. We 
are not concerned with these pretheo- 


retical elements. But we are concerned 
with other nonconstructural aspects 
which reveal themselves in the very con- 
struction and application of the theo- 
retical constructs themselves, and may 
even alter the meaning of the theoretical 
concepts. 

Our investigation will focus on three 


such nonconstructural elements: the at- 


tribution of real existence (as a real 
part of the organism) to theoretical 
constructs, the theoretical assumption of 
the construct as a quasi-real struc- 
ture for purely methodological purposes 
(without asserting its real existence in 
the organism), and lastly, the experi- 
ential reinterpretation of theoretical con- 
cepts which have been previously de- 
fined formally by systematic methods. 

Since the appearance of Bridgman’s 
introduction of the concept of the opera- 
tional definition into the methodology 
of psychology, psychological theorists 
have been deeply concerned over the 
adequacy of operationally defined con- 
structs to function as descriptions of 
psychological phenomena (12). Bridg- 
man’s explicitation of the operational 
method focused attention on the char- 
acter of the constructions which were 
related to any given complex of opera- 
tions. Tolman (32, 33) identified and 
used the intervening variable as a type 
of construct mediating operationally de- 
fined variables. Although he seems to 
have intended the intervening variable 
as a purely functional and relational 
concept, it is not clear that this purely 
relational concept did not turn out to be 
something quite different (31). 

The effort of MacCorquodale and 
Meehl (19) to clarify the terminology 
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applied to these constructions was a 
start in the direction of identifying the 
kinds of concepts which could function 
in a psychological theory. They dis- 
tinguished the intervening variable from 
the hypothetical construct. The inter- 
vening variable was an abstract, em- 
pirical relation, which merely served 
to organize lower level empirical laws. 
The hypothetical construct, however, in- 
volved some supposition of entities or 
processes, which were not directly ob- 
served, and therefore were not included 
in the original definitions. Essentially, 
this “surplus meaning” was taken to be 
the implication of the existence of en- 
tities and the occurrence of events in 
the real subject of investigation. How- 
ever, it seems that this interpretation of 
hypothetical constructs and their “sur- 
plus meaning” is rather limited and has 
led to a certain amount of unnecessary 
confusion, 

Supposing for the moment that “sur- 
plus meaning” has reference prim: 


be taken as meaning 
as formally un- 
+. 300d; really exists ag Part of the 
intrinsic, objective, honinferred, real 
structure, which constitutes the object 
of investigation, Or, the “real exist- 


zey (18) has expressed 
Sort of clarification in h 
description of what he termed “conven- 
tional constructs,” This type of con- 
Struct involves an inference about the 


reality of the organism no less that any 
hypothetical construct—but it is pre- 
cisely an inference about the organism 
as it is known in the theory. It is not 
an inference about the real structure of 
the organism itself, The former sense, 
if found in the formulation and inter- 
pretation of psychological theories, con- 
stitutes a gross misunderstanding of the 
nature of theoretical constructions, as 
well as a misapplication of methodology. 
The reasons for this will be discussed 
below. The latter sense, however, is a 
legitimate Systematic concept, and is the 
only acceptable and Possible meaning of 
“Teal existence” as applied to hypotheti- 
cal constructs. Since this sort of “sur- 
Plus meaning” is not found in the con- 
structed definition of the concept, how- 
ever, it is a nonconstructural element 


within the framework of its proper 


existence” is 


The “surplus 
“real existence” of 
cts must be reserved 
Teal existence in the latter 
retical correlate described 


meaning” involving 
hypothetical constru 
to theoretical- 
sense of a the 
above. 


The failure to recognize this distinc- 


tion between meanings of real existence 
has led to a reconfusion of terminology 
(25). The Supposition of theoretical- 
real existence has sometimes been as- 
similated to the intervening variable 
(22, 23), with the result that the in- 
tervening Variable is no longer merely 
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intervening, and the hypothetical con- 
struct is reduced to the level of a pre- 
liminary and imperfect stage in the de- 
velopment of a fully operational theory 
based on the intervening-variable type 
of construct alone. If Feigl’s “existen- 
tial assumption” can be interpreted in 
this sense of theoretical-real existence, it 
would seem that this plea for the return 
of the hypothetical construct is funda- 
mentally an appeal to this sort of dis- 
tinction in the meaning of “real exist- 
ence” (7, 8, 9). 

Correlative to the issue of real exist- 
ence is the question of the conceptual 
status of constructs and constructural 
models as they function in theory. The 
issue of the legitimacy or illegitimacy of 
the flat assertion that theoretical entities 
have real existence will be decided in 
terms of our understanding of the na- 
ture of these concepts. There is no 
question of the legitimacy of construc- 
tural concepts, but only of the assertion 
of their existence. Generally speaking, 
methodologists are aware that a theo- 
retical model can not be understood as 
representative of the real structure of 
some real entity (28, 31). This is fairly 


‘obvious in the case of the intervening 


variable, whose characteristics are of a 
purely formal nature. But in the case 
of the hypothetical construct, where 
some sort of objectivized (not objective) 
structure is implied, the issue is raised 
again and again. 

A correct appraisal of the nature of 
constructural methodology has led many 
theorists to evaluate the legitimate “sur- 
plus meaning” (i.e., theoretical-real ex- 
istence) of the hypothetical construct as 
an extrasystematic and, therefore, non- 
constructural element, which introduces 
an impure fraction into the constructural 
purity of the theory. However, the sur- 
plus enters into the understanding of 
such a construct only to the extent that 
the construct itself is structural (15). 
The hypothetical construct differs from 


an intervening variable precisely in this, 
that it hypothesizes some sort of struc- 
ture to explain the relation of lower- 
level variables (25). Obviously, a sci- 
entist can not think of structure, as op- 
posed to a merely formal or mathe- 
matical function, without thinking of it 
in terms of “theoretical-real existence.” 
This does not imply anything about the 
real structure of any real thing, but it 
does imply that the theorist is thinking 
of this construct as an analogue to the 
real structure of a real thing. 

Theoretical models must be thought 
of then as research tools, which the 
theorist forms without direct dependence 
on or inference from the immediate data 
of experience. The constructural-theo- 
retical model is related to the ultimate 
real structure of the thing under in- 
vestigation through a complex set of 
relations, which involve certain speci- 
fied methodological procedures, selective 
processes, and measurement techniques. 
The point here is that theoretical con- 
structs are not directly tied down or de- 
termined by the real structure of theit 
objects. If they were so tied to the 
structure of the thing, there would be 
little room for the multiple hypothetical 
models we find in psychology, nor could 
the theorist enjoy the relative freedom 
in structuring the model that is now his. 
Tt is for this reason that the assertion of 
the noninferred real existence of such 
theoretical entities must be looked upon 
as illegitimate. 

Models should not really be thought 
of as primarily imitative, but rather as 
a sort of adaptable pattern, which can 
be compared with the available data of 
experiment to find out what adaptations 
are to be made and in what direction 
the course of our investigation must be 
turned (21). Although it is not clear 
to me what the criteria of usefulness are 
to be in the development of a psycho- 
logical theory, the recent emphasis on 
“usefulness” of theoretical constructs, 
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at least for the formulation of research 
hypotheses, marks an encouraging shift 
from the former preoccupation with 
reality (30). Even on logical grounds, 
it has been pointed out (20) that the 
common procedure of testing theories by 
inference and confirmation is open to the 
possible fallacy that the consequence 
is unique for that theory. If Conse- 
quence B is deduced from Theory A and 
experimentally confirmed, we have added 
to the probability of Theory A; but we 
cannot assume that the same Conse- 
quence B may not be implied by some 
alternative theory. The hypothetical 
construct is a surrogate or symbol which 
is constructed in the theorist’s mind as 
a substitute for the real structure of the 
organism, and which is only indirectly 
related to the structure of that organ- 
ism. The whole direction or impulse of 
theory construction is to the progressive 
formulation of better theoretical con- 
structions which are capable of more 
rigorous evaluation and which will grad- 
ually converge on the real structure of 
the organism (24). 
Feigl (6) has expressed the belief that 
higher-order hypothetical constructs, 
which are built on and constructed from 
operations, will gain real insight into the 
natures of things. But in a later writ- 
ing (8), he recognizes that the meaning 
of a psychological concept is generally 
not identical with its defining operations. 
If we are able to speak of insight into 
the natures of things (whatever that 
may mean in the context of construc- 
tural theory), whatever meaning is con- 
tained in higher-level constructs must be 
reflected in the lower-level definitions in 
terms of observable data. If meaning 
and definition are not identical, we must 
explain the difference. I do not mean 
that we must explain the fact of a dif- 
ference, but we must define the differ- 
ence. Otherwise our procedure is un- 
scientific and methodologically unsound. 
Along this same line, Krech (17) has 


pointed out that hypothetical constructs 
could not function in a purely psycho- 
logical psychology, since the purely psy- 
chological data are beyond the reach of 
scientific inquiry. Because the experi- 
ence of a private datum is an unveri- 
fiable event, and because science is con- 
cerned only with verifiable events, some 
would claim that a scientific psychology 
of private experience is a contradiction 
(32). Either psychology proceeds be- 
havioristically and is a science, or it 
treats of the private datum and relin- 
quishes its claim to scientific status (4). 
Thus Krech has indicated that the con- 
cepts of tensions, needs, and cognitive 
structures are sometimes defined as hy- 
pothetical constructs, and sometimes as 
conscious experience or behavior (17). 
This means that there has been a shift 
in the formal meaning of these concepts, 
and we are dealing with two concepts 
instead of only one. One is a formally 
defined hypothetical construct, which 
functions as an integral part of a theory. 
As such, it is linked to the observable 
data, from which it is structured, by 
low-level empirical laws and operational 
definitions. The other concept, which 
represents the redefinition of the previ- 
ous hypothetical construct in terms of 
experience, is more difficult to delineate 
because of the fact that it is rarely, if 
ever, given any sort of explicit or formal 
presentation. However, in general, it 
seems to take the form of a subtle re- 
interpretation of a previously defined 
concept in terms of some sort of phe- 
nomenological experience. This sort of 
shift in meaning has been pointed out as 
due to a failure to distinguish between 
the operational meaning of intervening 
variables or hypothetical constructs and 
the intuitive properties ascribed to them 
(14). 

This shift in the interpretation of con- 
cepts must not be confused with the 
possible shift that sometimes occurs in 
the construction of theoretical systems 


Nonco’ 


between intervening variables and hy- 
pothetical constructs. It has sometimes 
happened that a concept is defined and 
constructed as a purely functional inter- 
vening variable, but later in the develop- 
ment of the theory it acquires a denota- 
tive meaning and a certain amount of 
structural connotation, which qualifies 
it as a hypothetical construct (10). 
Both Tolman (26) and Hull (10, 19, 
25, 27) have been accused of permitting 
this subtle reorientation in the mean- 
ing of their concepts. Very likely, 
Bergmann’s dismissal of the distinction 
between intervening variable and hy- 
pothetical construct as a pseudodistinc- 
tion (3) stems from insufficient atten- 
tion to the possibility of this sort of 
subtle shift in the formal meaning of 
concepts. 

As long as the surplus meaning, which 
constitutes the concept as a hypothetical 
construct, involves no more than the as- 
sumption of theoretical-real existence, 
the procedure is by no means illegiti- 
mate, and can be scientifically fruitful. 
It is this sense, I believe, that Feig] has 
spoken of “existential assumptions” (7, 
8). However, if the surplus meaning in- 
volves the implication of real existence 
(involving attribution of the hypotheti- 
cal construct to the organism as part of 
the real structure of the organism), the 
connotation is illegitimate and unscien- 
tific, as we have already pointed out. 
There seems to be a certain amount 0 
evidence to suggest that Hull was not 
always able to avoid this error. In re- 
gard to the sHr, he wrote that “it exists 
as an organization as yet largely un- 
known, hidden within the complex struc- 
ture of the nervous system” (11, P- 
102). Likewise, MacCorquodale an 
Meehl were quick to suggest that Hull's 
early articles on conditioning were writ- 
ten because Hull “considered that the 
hypothetical events represented in his 
diagrams may have actually occurred 
and that the occurrence of the events 
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represents the underlying truth about 
the learning phenomena he dealt with” 
(19, pp. 104-105). 

In our discussion thus far, we have 
seen that this sort of implication of the 
real existence of an inferred entity as a 
part of the real structure of an organism 
is extrasystematic and to that extent, 
nonconstructural. This also constitutes 
an illegitimate inference, which is not only 
not supported by the defining method, 
but is contrary to the very nature of 
the scientific constructural methodology. 
We have also seen that the theoretical 
assumption of the constructed concept 
as correlative, and perhaps suggestive of 


the real structure of the organism, 1S 


again extrasystematic and nonconstruc- 


tural, but in this case the assumption of 
existence (which is equivalently an as- 
sumption of nonexistence) is entirely 
legitimate, and in fact a necessary cor 
relate of the development of theoretical 
concepts. We have further suggested a 
third kind of nonconstructural ingredi- 
ent which seems to function in psycho- 
logical theory: the reinterpretation of 
systematically constructed concepts in 
terms of experiential variables. This last 
nonconstructural aspect deserves further 
consideration. 

Spence (31) has already pointed out 
that the method of field theorists like 
Lewin depends heavily on phenomeno- 
logical introspection for the introduction 
of theoretical concepts. If these con- 
cepts are then defined systematically in 
terms of given observables, they func- 
] parts of a scientifically 
constructed theory. But, if they con- 
tinue to be applied and interpreted in 


something else by way of formal con- 
cepts. It is quite possible that the cor- 
relation of empirical constructs with 
formal terms according to the logic of 
dynamics, as Koch (16) has analyzed 
them in Lewin’s system, presents a dif- 


148 W. W. MEISSNER 


ficulty precisely in so far as the em- 
pirical constructs are functioning at a 
phenomenological level and the formal 
terms are presented at a constructural 
level. 

An example may help to clarify the 
problem. Hull (11) introduces his 
concept of habit strength as a func- 
tional stimulus-response relation between 
stimulus energy, response, nature and 
number of reinforcements, and the re- 
action potential, which is measured in 
terms of the actual response. This in- 
tervening-variable type of concept is 
then conceived as a dynamic relation- 
ship between the afferent and efferent 
neural impulses. This seems to be more 
of a hypothetical construct, but the defi- 
nition of the term is still couched in sys- 
tematic and operational terms, and the 
switch from intervening variable to hy- 
pothetical construct is still acceptable. 
However, at one point Hull tells us that 
“it would appear that the presence of 
internal conditions or neural organiza- 
tions, such as habit structures (sHr) 
and reaction potentials (sEr) are also 
reportable” by introspection (13, p: 
344). He is also explicit in stating his 
conception of receptor-effector connec- 
tions in terms of common sense habits 
(11). If Hull is able to identify these 
constructs by introspective methods, it 
follows that the concept, as operation- 
ally defined and identified, is now being 
reinterpreted and reidentified in terms 
of experiential elements which are not 
included in the systematic formulation 
of these concepts. Phenomenological 
introspection is not restricted to field 
theorists alone, so it seems. 

The introduction of these noncon- 
structural elements into even the most 
sophisticated of behavioral systems 
raises a serious question. Snygg (29) 
has raised the issue of a phenomenologi- 
cal psychology. But, like Tolman (32) 
before him, he draws the line between 
the phenomenological and the objective 


in such a way as to give us a none too 
happy choice. Are we forced to make 
such a choice? The scientific method, 
as it has been transferred from natural 
sciences to psychology, seems to have 
exhausted a good deal of its potentiality 
(5). Further, as a method, it is not 
equipped to deal with purely psycho- 
logical data. As a result, and this is the 
point I have tried to make in this paper, 
psychological theory has resorted to a 
subtle, almost unconscious, translation 
of its systematic concepts in terms of 
psychological experience. This step has 
been necessary, in a sense, in order that 
psychological constructs assume some 
semblance of pertinence and meaning- 
fulness for the human psychological 
situation. And this experiential inter- 
polation is a necessary development in 
all psychological theory, for it is only 
in terms of this sort of interpolation that 
psychological theory can function as sig- 
nificant for the problems of applied hu- 
man psychology, and for the prediction 
and control of behavior, 

Rather than interpret this analysis 
as disjunctive and exclusive—and this 
seems to be the prevailing attitude of 
psychological theorists—it might prove 
promising to take it as conjunctive and 
inclusive. I do not mean that the ex- 
periential and objective aspects are re- 
ducible to common grounds. Nor do I 
mean that one or the other should be 
eliminated. The objective, scientific 
techniques developed by experimental 
psychology are invaluable, and any at- 
tempt at devaluating or reducing them 
would be unrealistic, to say the least. 
Experimental methods are a necessary 
means of introducing control, confirma- 
tion, and validation into theory con- 
struction. But it does not seem that 
these methods are in themselves suf- 
ficient to meet the demands of a scien- 
tific psychology. Our theories are now 
subject to experiential reinterpretation 
which takes on the color of a necessary 


- 
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adjunct to their significance and applica- 
bility. As the situation now stands, this 
experiential interpolation is unhealthy 
from the standpoint of scientific theory 
because it is extrasystematic and has no 
formal place in our theory construction. 
For this reason, the more rigorous theo- 
rists have tended to reject it explicitly, 
ol they cannot avoid it implicitly 

But, it is not entirely clear that phe- 
nomenological or experiential aspects 
need be so excluded. If these elements 
can be made explicit in our theories and 
can be justified in our methodologies, 
the present status of psychology would 
be much healthier than it is. The 
formulation of a distinctively psycho- 
logical methodology, based on the in- 
clusion of both objective and experi- 
ential methods, and giving rise to theo- 
retical construction of a distinctively 
psychological character, is a very tempt- 
ing goal. The reconstruction of such a 
distinctive and comprehensive method- 
ology must be left to the future. But 
at least a first step in its direction must 
be taken by pointing out the revelant 
elements as they function in the theories 
we have at the present time. 


SUMMARY 


Three nonconstructural elements, 
which appear in current psychological 
constructural systems, are singled out 
and briefly discussed: (a) attribution 
to a theoretical construct of “real ex- 
istence,” as if the construct were ac- 
tually a description of the real structure 
of the organism, rather than of the or- 
ganism as it is known in the theory; 
(b) attribution of “theoretical-real ex- 
istence” to a construct, without attribut- 
ing the defined structure to the real 
organism which is merely a methodo- 
logical assumption and implies nothing 
about the real organisms; (c) phe- 
nomenological or experiential reinterpre” 
tation of theoretical constructs. 
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these nonconstructural aspects, the first 
is illegitimate, the second is legitimate 
and scientifically sound, and the last is 
discussed in the light of possible evolu- 
tion of a distinctively psychological 
methodology which would incorporate 
both phenomenological and behavioris- 


tic elements. 
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In this paper we shall set forth the 
elements of a theory of human problem 
solving, together with some evidence for 
its validity drawn from the currently 
accepted facts about the nature of prob- 
lem solving. What questions should a 
theory of problem solving answer? First, 
it should predict the performance of a 
problem solver handling specified tasks. 
Tt should explain how human problem 
solving takes place: what processes are 
used, and what mechanisms perform 
these processes. It should predict the 
incidental phenomena that accompany 
problem solving, and the relation of 
these to the problem-solving process. 
For example, it should account for “set” 
and for the apparent discontinuities that 
are sometimes called “insight.” It should 
show how changes in the attendant con- 
ditions—both changes “inside” the prob- 
lem solver and changes in the task con- 
fronting him—alter problem-solving be- 
havior. It should explain how specific 
and general problem-solving skills are 
learned, and what it is that the problem 
solver “has” when he has learned them. 


Information Processing Systems 


Questions about problem-solving be- 
havior can be answered at various levels 
and in varying degrees of detail. The 
theory to be described here explains 
problem-solving behavior in terms 0 
what we shall call information processes- 
Tf one considers the organism to consist 
of effectors, receptors, and a control syS~ 
tem for joining these, then this theory 
is mostly a theory of the control sys 
tem. It avoids most questions of se? 


sory and motor activities. The theory 


postulates: 

1. A control system consisting of a 
number of memories, which contain 
symbolized information and are inter- 
connected by various ordering relations. 
The theory is not at all concerned with 
the physical structures that allow this 
symbolization, nor with any properties 
of the memories and symbols other than 
those it explicitly states. 

2. A number of primitive information 
processes, which operate on the informa- 
tion in the memories. Each primitive 
process is a perfectly definite operation 
for which known physical mechanisms 
exist. (The mechanisms are not neces- 
sarily known to exist in the human 
brain, however—we are only concerned 
that the processes be described without 
ambiguity.) 

3. A perfectly definite set of rules for 
combining these processes into whole 
programs of processing. From a pro- 
gram it is possible to deduce unequivo- 
cally what externally observable behav- 
jors will be generated. 

At this level of theorizing, an ex- 
planation of an observed behavior of 
the organism is provided by a program 
of primitive information processes that 
generates this behavior. 

A program viewed as a theory of 
behavior is highly specific: it describes 
one organism in a particular class of 
situations. When either the situation 
or the organism is changed, the pro- 
gram must be modified. The program 
can be used as a theory—that is, as a 
predictor of behavior—in two distinct 
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ways. First, it makes many precise 
predictions that can be tested in detail 
regarding the area of behavior it is 
designed to handle. For example, the 
theory considered in this paper predicts 
exactly how much difficulty an organ- 
ism with the specified program will en- 
counter in solving each of a series of 
mathematical problems: which of the 
problems it will solve, how much time 
(up to a proportionality constant) will 
be spent on each, and so on. 

Second, there will be important quali- 
tative similarities among the programs 
that an organism uses in various situa- 
tions, and among the programs used by 
different organisms in a given situation. 
The program that a human subject uses 
to solve mathematical problems will be 
similar in many respects to the program 
he uses to choose a move in chess; the 
program one subject uses for any such 
task will resemble the programs used by 
other subjects Possessing similar train- 
ing and abilities. If there were no such 
similarities, if each subject and each 
task were completely idiosyncratic, there 
could be no theory of human problem 
solving. Moreover, there is some posi- 
tive evidence, as we shall see, that such 
similarities and general characteristics 
of problem-solving processes do exist. 

In this paper we shall limit ourselves 

to this second kind of validation of our 
theory of problem solving. We shall 
predict qualitative characteristics of hu- 
man problem-solving behavior and com- 
pare them with those that have already 
been observed and described. Since all 
of the available data on the psychology 
of human problem solving are of this 
qualitative kind, no more detailed test 
of a program is possible at present. The 
more precise validation must wait upon 
new experimental work.t 


1Several studies of individual and group 
problem-solving behavior with logic problems 
have been carried out by O. K, Moore and 
Scarvia Anderson (5). The problems Moore 


In succeeding sections we shall de- 
scribe an information-processing pro- 
gram for discovering proofs for theo- 
rems in logic. We shall compare its be- 
havior qualitatively with that of human 
problem solvers. In general, the proc- 
esses that compose the program are 
familiar from everyday experience and 
from research on human problem solv- 
ing: searching for possible solutions, 
generating these possibilities out of 
other elements, and evaluating partial 
solutions and cues. From this stand- 
point there is nothing particularly novel 
about the theory, It rests its claims on 
other considerations: 

1. It shows Specifically and in detail 
how the processes that occur in human 
Problem solving can be compounded out 
of elementary information Processes, and 


hence how they can be carried out by 
mechanisms, 


2. It shows that a 
rating such Processes, with appropriate 
organization, can in fact solve problems. 
This aspect of problem solving has been 
thought to be “mysterious” and unex- 
plained because it Was not understood 
how sequences of simple processes could 
account for the successful solution of 
complex problems, The theory dissolves 
the mystery by showing that nothing 
more need be added to the constitution 
of a successful problem solver, 


Relation to Digital Computers 
The ability to s 


cisely, and to inf 
havior they will 


Program incorpo- 


pecify Programs pre- 
er accurately the be- 
Produce, derives from 


and Anderson gave th 


what different from those handled by our 
program, and hence a detailed comparison of 
behavior is not yet possible. We are now en- 
gaged, with Peter Houts, in replicating and 
extending the experiments of Moore and An- 
derson with human subjects and at the same 
time modifying our program to predict the 
human laboratory behavior in detail. 


eir subjects are some- 


—_ a 
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information processes that purports to 
describe some human behavior—is coded 
for acomputer. That is, each primitive 
information process is coded to be a 
separate computer routine, and a “mas- 
ter” routine is written that allows these 
primitive processes to be assembled into 
any system we wish to specify. Once 
this has been done, we can find out 
exactly what behavior the purported 
theory predicts by having the computer 
“simulate” the system. 

We wish to emphasize that we are not 
using the computer as & crude analogy 
to human behavior—we are not com- 
paring computer structures with brains, 
nor electrical relays with synapses. Our 
Position is that the appropriate way to 
describe a piece of problem-solving be- 
havior is in terms of a program: a 
specification of what the organism will 
do under varying environmental circum- 
stances in terms of certain elementary 
information processes it is capable of 
performing. This assertion has nothing 
to do—directly—with computers. Such 
programs could be written (now that 
we have discovered how to do it) if 
computers had never existed.? A pro- 
gram is no more, and no less, an analogy 
to the behavior of an organism than is 
a differential equation to the behavior 
of the electrical circuit it describes. 
Digital computers come into the picture 


_ *We can, in fact, find a number of attempts 
in the psychological literature to explain be- 
havior in terms of programs—or the proto- 
types thereof. One of the most interesting, 
because it comes relatively close to the mod- 
ern conception of a computer program, is 
Adrian de Groot’s analysis of problem solving 
by chess players (2). The theory of de Groot 
is based on the thought-psychology of Selz, 2 
somewhat neglected successor to the Wurzburg 
School. Quite recently, and apparently inde- 
Pendently, we find the same idea applied by 
Jerome S. Bruner and his associates to the 
theory of concept formation (1). Bruner uses 
the term “strategy,” derived from economics 
and game theory, for what we have called 4 
program. 


only because they can, by appropriate 
programming, be induced to execute the 
same sequences of information processes 
that humans execute when they are 
solving problems. Hence, as we shall 
see, these programs describe both hu- 
man and machine problem solving at 
the level of information processes.® 
With this discussion of the relation of 
programs to machines and humans be- 
hind us, we can afford to relax into con- 
venient, and even metaphoric, uses of 
language without much danger of mis- 
understanding. It is often convenient 
to talk about the behavior implied by a 
program as that of an existing physical 
mechanism doing things. This mode of 
expression is legitimate, for if we take 
the trouble to put any particular pro- 
gram in a computer, We have in fact a 
machine that behaves in the way pre- 
scribed by the program. Similarly, for 
concreteness, We will often talk as i 
our theory of problem solving consisted 
of statements about the ability of a 
computer to do certain things. 


Tue Locic THEORIST 


We can now turn to an example of 
the theory. This is a program capable 
of solving problems in a particular do- 
main—capable, specifically, of discover- 
ing proofs for theorems in elementary 
symbolic logic. We shall call this pro- 
gram the Logic Theorist (LT).* We 
assert that the behavior of this pro- 
gram, when the stimulus consists of the 
instruction that it prove a particular 
theorem, can be used to predict the be- 


sFor a fuller discussion of this point see 


). 

4Jn fact, matters are a little more compli- 
cated, for in the body of this paper we will 
consider both the basic program of LT and a 
number of variants on this program. We will 
refer to all of these variants, interchangeably, 
as “LT.” This will not be confusing, since 
the exact content of the program we are con- 
sidering at any particular point will always 
be clear from the context. 
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havior of (certain) humans when they 
are faced with the same problem in 
symbolic logic. 

The program of LT was not fash- 
ioned directly as a theory of human be- 
havior; it was constructed in order to 
get a program that would prove theo- 
rems in logic. To be sure, in construct- 
ing it the authors were guided by a firm 
belief that a practicable program could 
be constructed only if it used many of 
the processes that humans use. The 
fact remains that the program was not 
devised by fitting it directly to human 
data. As a result, there are many de- 
tails of LT that we would not expect to 
correspond to human behavior. For ex- 
ample, no particular care was exercised 
in choosing the primitive information 
processes to correspond, point by point, 
with elementary human processes. All 
that was required in writing the pro- 
gram was that the primitive processes 
constitute a sufficient set and a con- 
venient set for the type of program 
under study. 

Since LT has been described in de- 
tail elsewhere (6, 8), the description 
will not be repeated here. It will also 
be unnecessary to describe in detail the 
system of symbolic logic that is used by 
LT. For those readers who are not fa- 
miliar with symbolic logic, we may re- 
mark that problems in the sentential 
calculus are at about the same level of 
difficulty and have somewhat the same 
“flavor” as problems in high school ge- 
ometry.® 


Design of the Experiments 


First we will describe the overt be- 
havior of LT when it is presented with 
problems in elementary symbolic logic. 


5 LT employs the sentential calculus as set 
forth in Chapters 1 and 2 of A, N. Whitehead 
and Bertrand Russell, Principia Mathematica 
(10)—the “classic” of modern symbolic logic. 
A simple introduction to the system of Prin- 
cipia will be found in (3). 


In order to be concrete, we will refer to 
an experiment conducted on a digital 
computer. We take an ordinary gen- 
eral-purpose digital computer,* and store 
in its memory a program for interpret- 
ing the specifications of LT. Then we 
load the program that specifies LT. 
The reader may think of this program 
as a collection of techniques that LT 
has acquired for discovering proofs. 
These techniques range from the abil- 
ity to read and write expressions in 
symbolic logic to general schemes for 
how a proof might be found. 

Once we have loaded this program 
and pushed the start button, the com- 
puter, to all intents and purposes, és 
LT. It already knows how to do sym- 
bolic logic, in the sense that the basic 
tules of operation of the mathematics 
are already in the program (analogously 
to a human’s knowing that “equals 
added to equals give equals” in ele- 
mentary algebra). 

We are now ready to give LT a task. 
We give it a list of the expressions 
(axioms and Previously proved theo- 
rems) that it may take as “given” for 
the task at hand. These are stored in 
LT’s memory, Finally, we present LT 
with another expression and instruct it 
to discover a proof for this expression. 

From this point, the computer is on 
its own. The Program plus the task 
uniquely determines its behavior. It at- 
tempts to find a Proof—that is, it tries 
various techniques, and if they don’t 


6 The experiments described here were car- 
ried out with the RAND JOHNNIAC com- 
puter. The JOHNNIAC is an automatic dig- 
ital computer of the Princeton type. It has a 
word length of 40 bits, with two instructions 
in each word. Its fast storage consists of 
4,096 words of magnetic cores, and its second- 
ary storage consists of 9,216 words on mag- 
netic drums. Its speed is about 15,000 op- 
erations per second. The programming tech- 
niques used are described more fully in (6)- 
The experiments are reported in more det: 


in (7). 


a 
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work, it tries other techniques. If LT 
finds a legitimate proof, it prints this 
out on a long strip of paper. There is, 
of course, no guarantee that it will find 
a proof; after working for some time, 
the machine will give up—that is, it will 
stop looking for a proof. 

Now the experimenters know exactly 
what is in the memory of LT when it 
starts—indeed, they created the pro- 
gram. ‘This, however, is quite differ- 
ent from saying that the experimenters 
can predict everything LT will do. In 
principle this is possible; but in fact 
the program is so complex that the only 
way to make detailed predictions is to 
employ a human to simulate the pro- 
gram by hand. (A human can do any- 
thing a digital computer can do, al- 
though it may take him considerably 
longer.) 

1. As the initial experiment, we stored 
the axioms of Principia Mathematica, 
together with the program, in the mem- 
ory of LT, and then presented to LT 
the first 52 theorems in Chapter 2 of 
Principia in the sequence in which they 
appear there. LT’s program specified 
that as a theorem was proved it was 
stored in memory and was available, 
along with the axioms, as material for 
the construction of proofs of subsequent 
theorems. With this program and this 
order of presentation of problems, L 
succeeded in proving 38 (73%) of the 
52 theorems. About half of the proofs 
were accomplished in less than a minute 
each; most of the remainder took from 
one to five minutes. A few theorems 
were proved in times ranging from 
minutes to 45 minutes. There was 4 
strong relation between the times an 
the lengths of the proofs—the time in- 
creasing sharply (perhaps exponentially) 
with each additional proof step- 

2. The initial conditions were now re- 
stored by removing from LT’s memory 
the theorems it had proved. 
late: “A new subject was obtaine' 


knew how to solve problems in logic but 
was unfamiliar with the particular prob- 
lems to be used in the experiment.”) 
When one of the later theorems of 
Chapter 2 (Theorem 2.12) was pre- 
sented to LT, it was not able to find 
a proof, although when it had held the 
prior theorems in memory, it had found 
one in about ten seconds. 

3. Next, an experiment was performed 
intermediate between the first two. The 
axioms and Theorem 2.03 were stored in 
memory, but not the other theorems 
prior to Theorem 2.12, and LT was 
again given the task of proving the lat- 
ter. Now, using Theorem 2.03 as one 
of its resources, LT succeeded—in fif- 
teen minutes—where it had failed in the 
second experiment. The proof required 
three steps. In the first experiment, 
with all prior theorems available, the 
proof required only one step. 


Outcome of the Experiments 


From these three series of experiments 
we obtain several important pieces of 
evidence that the program of LT is 
qualitatively like that of a human faced 
with the same task. The first, and 
most important, evidence is that LT 
does in fact succeed in finding proofs 
for a large number of theorems. 

Let us make this point quite clear. 
Since LT can actually discover proofs 
for theorems, its program incorporates a 
sufficient set of elementary processes at- 
ranged in a sufficiently effective stratesy 
to produce this result. Since no other 
program has ever been specified for 
handling successfully these kinds of 
problem-solving tasks, no definite al- 
ternative hypothesis is available. We 
are well aware of the standard argu- 
ment that “similarity of function does 
not imply similarity of process.” How- 
ever useful a caution this may be, it 
should not blind us to the fact that 
specification of a set of mechanisms 
sufficient to produce observed behavior 
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is strong confirmatory evidence for the 
theory embodying these mechanisms, es- 
pecially when it is contrasted with theo- 
ries that cannot establish their suffi- 
ciency. 

The only alternative problem-solving 
mechanisms that have been completely 
specified for these kinds of tasks are 
simple algorithms that carry out ex- 
haustive searches of all possibilities, 
substituting “brute force” for the selec- 
tive search of LT. Even with the speeds 
available to digital computers, the prin- 
cipal algorithm we have devised as an 
alternative to LT would require times 
of the order of hundreds or even thou- 
sands of years to Prove theorems that 
LT proves in a few minutes. LT’s suc- 
cess does not depend on the “brute 
force” use of a computer’s speed, but 
on the use of heuristic Processes like 
those employed by humans.” This can 
be seen directly from examination of the 
Program, but it also shows up repeat- 
edly in all the other behavior exhibited 
by LT. 

The second important fact that 
emerges from the experiments is that 
LT’s success depends in a very sensitive 
way upon the order in which Problems 
are presented to it. When the sequence 
is arranged so that before any Particu- 
lar problem is reached Some potentially 
helpful intermediate Tesults have al- 
ready been obtained, then the task is 
easy. It can be made Progressively 
harder by skipping more and more 
of these intermediate Stepping-stones, 
Moreover, by Providing a single “hint,” 
as in the third experiment (that is, 
“Here is a theorem that might help”), 
we can induce LT to solye a problem 
it had previously found insoluble. All 
of these results are easily reproduced in 
the laboratory with humans, To com- 
pare LT’s behavior with that of a hu- 


7 A quantitative analysis of the power of the 
heuristics incorporated in LT will be found in 


(7). 
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man subject, we would first have to 
train the latter in symbolic logic (this 
is equivalent to reading the program 
into LT), but without using the specific 
theorems of Chapter 2 of Principia 
Mathematica that are to serve as prob- 
lem material. We would then present 
problems to the human subject in the 
same sequence as to LT. For each new 
sequence we would need naive subjects, 
since it is difficult to induce a human 
subject to forget completely theorems 
he has once learned. 


PERFORMANCE PROCESSES IN THE 
Locic THEORIST 


We can learn more about LT’s ap- 
proximation to human problem solving 
by instructing it to print out some of 
its intermediate results—to work its 


methods it applies, successfully and un- 
successfully, i 
We can obtain this infor 
level of detail we wish, and make 
correspondingly detailed study of LT’s 
processes, 
To understand the additional informa- 
tion provided by this “thinking aloud” 
Procedure, we need to describe a little 


rect answer on a p 
we infer from it about other processes 

on inside the subject (or the machine) is, of 
course, another question, In the case of the 
machine, the Problem js simpler than in the 
case of the human, for we can determine ex- 
actly the correspondence between the internal 
Processes and “what the machine ‘printal outs 


+ 


We 


he 


a 


Vi reve of whether p is true, 
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more fully how LT goes about solving 
problems. This description has two 
parts: (a) specifying what constitutes 
a proof in symbolic logic; (b) describ- 
ing the methods that LT uses in find- 
ing proofs. 


Nature oj a Proof 


A proof in symbolic logic (and in 
other branches of logic and mathemat- 
ics) is a sequence of statements such 
that each statement: (a) follows from 
one or more of the others that precede 
it in the sequence, or (b) is an axiom or 
previously proved theorem.° Here “fol- 
lows” means “follows by the rules of 
logic.” 

LT is given four rules of inference: 

Substitution. In a true expression 
(for example, “[p or p] implies p”) 
there may be substituted for any vari- 
able a new variable or expression, pro- 
vided that the substitution is made 
throughout the original expression. 
Thus, by substituting p or q for p in 
the expression “(p or p) implies p,” we 
get: “([p or q] or [p or q]) implies 
(p or q)” but not: “(fp or q] or P) 
implies p.” 

x The axioms of symbolic logic and the theo- 

ries that follow from them are all tautologies, 
true by virtue of the definitions of their terms. 
It is their tautological character that gives 
laws of logic their validity, independent of 
empirical evidence, as rules of inductive 10- 
ference. Hence the very simple axioms that 
we shall use as examples here will have an 
appearance of redundancy, if not triviality. 
For example, the first axiom of Principia 
states, in effect, that “if any particular sen- 
tence (call it p) is true, or if that same sen- 
tence (p) is true, then that sentence (p) _ is 
Je, “if frogs are sh, 
frogs are fish.” The 
utologically true 
for in 


frogs are not fish. 
in problem solving, not in logic, thé 
can regard LT’s task as one of manip 
symbols to produce desired expressions, F 
he can ignore the material interpretations o 


these symbols. 
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Replacement. In a true expression a 
connective (“implies,” etc.) may be re- 
placed by its definition in terms of other 
connectives. Thus “A implies B” is de- 
fined to be “not-A or B”; hence the two 
forms can be used interchangeably. 

Detachment. If “A” is a true expres- 
sion and “A implies B” is a true expres- 
sion, then B may be written down as a 
true expression. 

Syllogism (Chaining). It is possible 
to show by two successive applications 
of detachment that the following is also 
legitimate: If “a implies b” is a true 
expression and “b implies c” is a true 
expression, then “a implies c” is also a 
true expression. 


Proof Methods 


The task of LT is to construct a 
proof sequence deriving a problem ex- 
the axioms and the previ- 


pression from 
y the rules of 


ously proved theorems b 
inference listed above. But the rules 
of inference, like the rules of any mathe- 
matical system or any game, are per- 
missive, not mandatory. That is, they 
state what sequences 14y legitimately 
be constructed, not what particular se- 
quence should be constructed in order 
to achieve a particular result (i.e., to 
prove a particular problem expression). 
The set of “Jegal” sequences is exceed- 
ing large, and to try to find a suitable 
sequence by trial and error alone would 
almost always use UP the available time 
or memory before it would exhaust the 
set of legal sequences.”° 

To discover proofs, LT uses methods 
which are particular combinations of in- 
formation processes that result in co- 
ordinated activity aimed at progress in 
a particular direction. LT has four 
methods (it could have more): substi- 


10See (7). The situation here is like that 
jn chess Or checkers where the player knows 
what moves are legal but has to find in a rea- 
sonable time a move that is also “suitable’— 
that is, conducive to winning the game. 
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tution, detachment, forward chaining, 
and backward chaining. Each method 
focuses on a single possibility for achiev- 
ing a link in a proof. 

The substitution method attempts to 
prove an expression by generating it 
from a known theorem employing sub- 
stitutions of variables and replacements 
of connectives. 

The detachment method tries to work 
backward, utilizing the rule of detach- 
ment to obtain a new expression whose 
proof implies the proof of the desired 
expression. This possibility arises from 
the fact that if B is to be proved, and 
we already know a theorem of the form 
“A implies B,” then proof of A is tan- 
tamount to proof of B. 

Both chaining methods try to work 
backward to new problems, using the 
tule of syllogism, analogously to the 
detachment method. Forward chaining 
uses the fact that if “a implies c” js 
desired and “a implies b” is already 
known, then it is sufficient to prove “b 
implies c.” Backward chaining runs the 
argument the other way: desiring “g 
implies c” and knowing “b implies c” 
yields “a implies b” as a new problem. 

The methods are the major organiza- 
tions of processes in LT, but they are 
not all of it. There is an executive 
process that coordinates the use of the 
methods, and selects the subproblems 
and theorems upon which the methods 
operate. The executive Process also ap- 
plies any learning processes that are to 
be applied. Also, all the methods uti- 
lize common subprocesses jn carrying 
out their activity. The two most im- 
portant subprocesses are the matching 

process, which endeavors to make two 
given subexpressions identical, and the 
similarity test, which determines (on 
the basis of certain computed descrip- 
tions) whether two expressions are 
“similar” in a certain sense (for de- 
tails, cf. 8). 

LT can be instructed to list its at- 


ALLEN NEWELL, J. C. SHAW, AND HERBERT A. SIMON 


attempts, successful and unsuccessful, to 
use these methods, and can list the new 
subproblems generated at each stage by 
these attempts. We can make this con- 
crete by an example: 

Suppose that the problem is to prove 
“p implies p.” The statement “(p or 
P) implies p” is an axiom; and “p im- 
plies (p or p)” is a theorem that has 
already been proved and stored in the 
theorem memory, Following its pro- 
gram, LT first tries to prove “p im- 
plies p” by the substitution method, but 
fails because it can find no similar theo- 
rem in which to make substitutions. 

Next, it tries the detachment method. 
Letting B stand for “p implies p,” sev- 
eral theorems are found of the form “A 
implies B.” For example, by substitu- 
tion of not-p for q, “p implies (q or p)” 
becomes “p implies (not-p or p)”; this 
becomes, in turn, by replacement of 
“or” by “implies”: “p implies (p im- 
plies p).” Discovery of this theorem 
creates a new subproblem: “Prove A” 
—that is, “prove P.” This subproblem, 
of course, leads nowhere, since p is not 
a universally true theorem, hence can- 
not be proved. 

At a later Stage in its search LT tries 
the chaining method. Chaining for- 
ward, it finds the theorem “p implies (p 
or p)” and is then faced with the new 
problem of proving that e 


successful, in its proof—and the ones 
we have omitted from our exposition, as 
well—can be printed out to provide us 
with a complete record of how LT exe- 


cuted its program in solving this par- 
ticular problem. 


Some CHARACTERISTICS OF THE 
PROBLEM-SOLVING Process 


Using aS our data the information 
provided by LT as to the methods it 
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tries, the sequence of these methods, 
and the theorems employed, we can ask 
whether its procedure shows any Te- 
semblance to the human problem-solv- 
ing process as it has been described in 
psychological literature. We find that 
there are, indeed, many such resem- 
blances, which we summarize under the 
following headings: set, insight, con- 
cept formation, and structure of the 
problem-subproblem hierachy. 


Set 

The term “set,” sometimes defined as 
“a readiness to make a specified re- 
sponse to a specified stimulus” (4, P- 
65), covers a variety of psychological 
phenomena. We should not be sur- 
prised to find that more than one as- 
pect of LT’s behavior exhibits “set,” 
nor that these several evidences of set 
correspond to quite different underlying 
processes. 

1. Suppose that after the program 
has been loaded in LT, the axioms an 
a sequence of problem expressions are 
placed in its memory. Before LT un- 
dertakes to prove the first problem ex- 
pression, it goes through the list of 
axioms and computes a description of 
each for subsequent use in the “simi- 
larity” tests. For this reason, the proof 
of the first theorem takes an extra in- 
terval of time amounting, in fact, to 
about twenty seconds. Functionally 
and phenomenologically, this computa- 
tion process and interval represent a 
preparatory set in the sense in whic! 
that term is used in reaction-time €x- 
periments. It turns out in LT that this 
preparatory set saves about one third 
of the computing time that would other- 
wise be required in later stages 0 
program. x 

2. Directional set is also evident 19 
LT’s behavior. When it is attempting 
a particular subproblem, LT tries rst 
to solve it by the substitution method. 


If this proves fruitless, and only the 
Bure 


paip 


dni. 


it tries the detachment method, then 
chaining forward, then chaining back- 
ward. Now when it searches for 
theorems suitable for the substitution 
method, it will not notice theorems that 
might later be suitable for detachment 
(different similarity tests being applied 
in the two cases). It attends single- 
mindedly to possible candidates for sub- 
stitution until the theorem list has been 
exhausted; then it turns to the detach- 
ment method. 

3. Hints and the change in behavior 
they induce have been mentioned ear- 
lier. Variants of LT exist in which the 
order of methods attempted by LT, and 
the choice of units in describing expres- 
sions, depend upon appropriate hints 
from the experimenter. 

4. Effects from directional set occur 


in certain learning situations—as illus- 
y the classical ex- 


trated, for example, b 

periments of Luchins. Although LT at 
the present time has only a few learn- 
ing mechanisms, these will produce 
strong effects of directional set if prob- 
lems are presented to LT in appropriate 
sequences. For example, it required 
about 45 minutes to prove Theorem 
2.48 in the first experiment because 
LT, provided with all the prior theo- 
rems. explored so many blind alleys. 


quite different set of possibilities. 

The instances of set observable in the 
present program of LT are natural and 
unintended by-products of a program 
constructed to solve problems in an effi- 
cient way. In fact, it is difficult to see 
how we could have avoided such effects. 
In its simplest aspect, the problem-solv- 
ing process is a search for a solution in 
a very large space of possible solutions. 
The possible solutions must be examined 
in some particular sequence, and if they 
are, then certain possible solutions will 


be examined before, others. The par- 
Dav, Hese” on 


ue coli 
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ticular rule that induces the order of 
search induces thereby a definite set in 
the ordinary psychological meaning of 
that term. 

Preparatory set also arises from the 
need for processing efficiency. If cer- 
tain information is needed each time a 
possible solution or group of solutions 
is to be examined, it may be useful to 
compute this information, once and for 
all, at the beginning of the problem- 
solving Process, and to store it instead 
of Tecomputing it each time, 

The examples cited show that set can 
arise in almost every aspect of the 


fying Perceptions (the “viewing” set), 

and it can consist in preparatory proc- 

esses (the description of axioms). 
None of the examples of set in LT re- 


certain Psychologica] Phenomena bear 
this out—the set in association experi- 
ments, and so-called “incubation” Proc- 
esses. LT as it now stands is inade- 
quate in this respect. 


Insight 


In the Psychological literature, “in- 
sight” has two Principal connotations: 
(a) “suddenness” of discovery, and (b) 
grasp of the “structure” of the problem, 
as evidenced by absence of trial and 
error. It has often been pointed out 
that there is no necessary connection 
between the absence of Overt trial-and- 
error behavior and grasp of the prob- 
lem structure, for trial and eror may 
be perceptual or ideational, and no ob- 
vious cues may be present in behavior 
to show that it is going on. 

In LT an observer’s assessment of 
how much trial and error there is will 
depend on how much of the record of 


its problem-solving processes the com- 
puter prints out. Moreover, the amount 
of trial and error going on “inside” 
varies within very wide limits, depend- 
ing on small changes in the program. 
The Performance of LT throws some 
light on the Classical debate between 
Proponents of trial-and-error learning 
and proponents of “insight,” and shows 
that this Controversy, as it js usually 
phrased, rests on ambiguity and con- 
fusion. LT searches for solutions to 
the problems that are Presented it. 


_To keep matters simple, let us con- 
sider just one of the methods LT uses 


problem expression to be proved by sub- 
stitution. The crudest tri 


different sequences depending on the 


nature of the latter, Then, if the de- 
pendence is an 


amount of search i 


age can be reduced, A simple strategy 
of this sort would be to try in the 
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axioms only substitutions involving vari- 
ables that actually appear in the prob- 
lem expression. 

The actual generator employed by LT 
is more efficient (and hence more “in- 
sightful” by the usual criteria) than 
this. In fact, it works backward from 
the problem expression, and takes into 
account necessary conditions that a sub- 
stitution must satisfy if it is to work. 
For example, suppose we are substitut- 
ing in the axiom “p implies (q or ps? 
and are seeking to prove “r implies (r 
or r).” Working backward, it is clear 
that if the latter expression can be ob- 
tained from the former by substitution 
at all, then the variable that must be 
substituted for p is r. This can be seen 
by examining the first variable in each 
expression, without considering the rest 
of the expression at all (cf. 7). 

Trial and error is reduced to still 
smaller proportions by the method for 
searching the list of theorems. Only 
those theorems are extracted from the 
list for attempted substitution which 
are “similar” in a defined sense to the 
problem expression. This means, in 
practice, that substitution is attempted 
in only about ten per cent of the theo- 
rems. Thus a trial-and-error search of 
the theorem list to find theorems simi- 
lar to the problem expression is substi- 
tuted for a trial-and-error series of at- 
tempted substitutions in each of the 
theorems. 

In these examples, the concept of 
proceeding in a “meaningful” fashion 
is entirely clear and explicit. Trial- 
and-error attempts take place in some 
“space” of possible solutions. To ap- 
proach a problem “meaningfully” is to 
have a strategy that either permits the 
search to be limited to a smaller sub- 
Space, or generates elements of the spacé 
in an order that makes probable the 
discovery of one of the solutions early 


in the process. 
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We have already listed some of the 
most important elements in the pro- 
gram of LT for reducing search to 
tolerable proportions. These are: (a) 
the description programs to select theo- 
rems that are “likely” candidates for 
substitution attempts; (b) the process 
of working backwards, which uses in- 
formation about the goal to rule out 
large numbers of attempts without 
actually trying them. In addition to 
these, the executive routine may select 
the sequence . of subproblems to be 
worked on in an order that takes up 
“simple” subproblems first. 


Concepts 

Most of the psychological research on 
concepts has focused on the processes 
of their formation. The current ver- 
sion of LT is mainly a performance pro- 
gram, and hence shows no concept for- 
mation. There is in the program, how- 
ever, a clearcut example of the use of 
concepts in problem solving. This is 
the routine for describing theorems and 
searching for theorems “similar” to the 
problem expression or some part of it 
in order to attempt substitutions, de- 
tachments, or chainings. All theorems 
having the same description exemplify 
a common concept. We have, for ex- 
ample, the concept of an expression 
that has a single variable, one argument 
e on its left side, and two argument 


plac 
places on its right side: “p implies (p 
or p)” is an expression exemplifying 


this concept; so is “q implies (q im- 
plies q)-” 

The basis for these concepts is purely 
pragmatic. Two expressions having the 
same description “look alike” in some 
undefined sense; hence, if we are seek- 
ing to prove one of them as a theorem, 
while the other is an axiom or theorem 
already proved, the latter is likely con- 
struction material for the proof of the 


former. 
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Hierarchies of Processes 


Another characteristic of the behav- 
ior of LT that resembles human prob- 
lem-solving behavior is the hierarchical 
structure of its processes. Two kinds 
of hierarchies exist, and these will be 
described in the next two paragraphs. 

In solving a problem, LT breaks it 

down into component problems. First 
of all, it makes three successive at- 
tempts: a proof by substitution, a proof 
by detachment, or a proof by chaining. 
In attempting to prove a theorem by 
any of these methods, it divides its task 
into two parts: first, finding likely raw 
materials in the form of axioms or theo- 
tems previously proved; second, using 
these materials in matching. To find 
theorems similar to the problem expres- 
sion, the first step is to compute a de- 
scription of the problem expression; the 
second step is to search the list of theo- 
rems for expressions with the same de- 
scription. The description-computing 
program divides, in turn, into a pro- 
gram for computing the number of lev- 
els in the expression, a program for 
computing the number of distinct varj- 
ables, and a program for computing the 
number of argument places. 

LT has a second kind of hierarchy 
in the generation of new expressions to 
be proved. Both the detachment and 
chaining methods do not give proofs di- 
rectly but, instead, provide new alter- 
native expressions to prove, LT keeps 
a list of these subproblems, and, since 
they are of the same type as the origi- 
nal problem, it can apply all its prob- 
lem-solving methods to them. These 
methods, of course, yield yet other sub- 
problems, and in this way a large net- 
work of problems is developed during 
the course of proving a given logic ex- 
pression. The importance of this type 
of hierarchy is that it is not fixed 
in advance, but grows in response to 
the problem-solving process itself, and 


shows some of the flexibility and trans- 
ferability that seem to characterize hu- 
man higher mental processes. 

The problem-subproblem hierarchy in 
LT’s program is quite comparable with 
the hierarchies that have been discov- 
ered by students of human problem- 
solving processes, and particularly by 
de Groot in his detailed studies of the 
thought methods of chess players (2, 
pp. 78-83, 105-111). Our earlier dis- 
cussion of insight shows how the pro- 
gram structure permits an efficient com- 
bination of trial-and-error search with 
systematic use of experience and cues 
in the total problem-solving process. 


SUMMARY oF THE EVIDENCE 


We have now reviewed the principal 
evidence that LT solves problems in a 
manner closely resembling that exhibited 
by humans in dealing with the same 
problems. First, and perhaps most im- 
portant, it is in fact capable of finding 
proofs for theorems—hence incorporates 
a system of processes that is sufficient 
for a problem-solving mechanism. Sec- 
ond, its ability to solve a particular 
problem depends on the sequence in 
which problems are presented to it in 
much the same way that a human sub- 
ject’s behavior depends on this sequence. 
Third, its behavior exhibits both pre- 
paratory and directional set. Fourth, 
it exhibits insight both in the sense of 
vicarious trial and error leading to “sud- 
den” problem solution, and in the sense 
of employing heuristics to keep the to- 
tal amount of trial and error within rea- 
sonable bounds. Fifth, it employs sim- 
ple concepts to classify the expressions 
with which it deals, Sixth, its program 
exhibits a complex organized hierarchy 
of problems and subproblems. 


COMPARISON WITH OTHER THEORIES 

We have proposed a theory of the 
higher mental Processes, and have shown 
how LT, which is a particular exemplar 


Try 
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of the theory, provides an explanation 
for the processes used by humans to 
solve problems in symbolic logic. What 
is the relation of this explanation to 
others that have been advanced? 


Associationism 


The broad class of theories usually 
labelled “associationist” share a gener- 
ally behaviorist viewpoint and a com- 
mitment to reducing mental functions 
to elementary, mechanistic neural events. 
We agree with the associationists that 
the higher mental processes can be per- 
formed by mechanisms—indeed, we have 
exhibited a specific set of mechanisms 
capable of performing some of them. 

We have avoided, however, specify- 
ing these mechanisms in neurological 
or pseudo-neurological terms. Problem 
solving—at the information-processing 
level at which we have described it— 
has nothing specifically “neural” about 
it, but can be performed by a wide class 
of mechanisms, including both human 
brains and digital computers. We do 
not believe that this functional equiva- 
lence between brains and computers im- 
plies any structural equivalence at a 
more minute anatomical level (€-8+ 
equivalence of neurons with circuits). 
Discovering what neural mechanisms Te- 
alize these information-processing func- 
tions in the human brain is a task for 
another level of theory construction. 
Our theory is a theory of the informa- 
tion processes involved in pr 
ing, and not a theory of neural or elec- 
tronic mechanisms for information proc- 
essing. 

The picture of the cen 
system to which our theor: 
picture of a more complex 
system than that contemplated by most 
associationists. The notions of “ 
“fixation,” “excitation,” an 
tion” suggest @ relatively P 
trochemical system (or, alte! 
passive switchboard”), acted upor by 


stimuli, altered by that action, and sub- 
sequently behaving in a modified man- 
ner when later stimuli impinge on it. 
In contrast, we postulate an informa- 
tion-processing system with large stor- 
age capacity that holds, among other 
things, complex strategies (programs) 
that may be evoked by stimuli. The 
stimulus determines what strategy Or 
strategies will be evoked; the content 
of these strategies is already largely de- 
termined by the previous experience of 
the system. The ability of the system 
to respond in complex and highly se- 
lective ways to relatively simple stimuli 
is a consequence of this storage of pro- 
grams and this “active” response to 
stimuli. ‘The phenomena of set and in- 
sight that we have already described 
and the hierarchical structure of the re- 
tem are all consequences of 


sponse sys 
this “active” organization of the cen- 


tral processes. 

The historical preference of behavior- 
ists for a theory of the brain that pic- 
tured it as a passive photographic plate 
or switchboard, rather than as an ac- 
tive computer, is no doubt connected 
with the struggle against vitalism. The 
invention of the digital computer has 
acquainted the world with a device— 
obviously a mechanism—whose response 
to stimuli is clearly more complex and 
“active” than the response of more tra- 
ditional switching networks. It has pro- 
vided us with operational and unobjec- 
tionable interpretations of terms like 
“purpose,” “set,” and “insight.” The 
real importance of the digital computer 
for the theory of higher mental proc- 
esses lies not merely in allowing us to 
realize such processes “in the metal” 
and outside the brain, but in providing 
us with a much profounder idea than 
we have hitherto had of the character- 
istics a mechanism must possess if it is 
to carry out complex information-proc- 
essing tasks. 
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Gestalt Theories 


The theory we have presented resem- 
bles the associationist theories largely in 
its acceptance of the premise of mecha- 
nism, and in few other respects. It re- 
sembles much more closely some of the 
Gestalt theories of problem solving, and 
perhaps most closely the theories of “di- 
rected thinking” of Selz and de Groot. 
A brief overview of Selz’s conceptions 
of problem solving, as expounded by 
de Groot, will make its relation to our 

theory clear. 

1. Selz and his followers describe 
problem solving in terms of processes 
or “operations” (2, p. 42). These are 
clearly the counterparts of the basic 
processes in terms of which LT is speci- 
fied. 

2. These operations are organized in 
a strategy, in which the outcome of each 
step determines the next (2, p. 44). 
The strategy is the counterpart of the 
program of LT. 

3. A problem takes the form of a 
“schematic anticipation.” That is, it is 
posed in some such form as: F ind an X 
that stands in the specified relation R 
to the given element E (2, pp. 44—46). 
The counterpart of this in LT is the 
problem: Find a sequence of sentences 
(X) that stands in the relation of proof 
(R) to the given problem expression 
(E). Similarly, the subproblems posed 
by LT can be described in terms of 
schematic anticipations: for example, 
“Find an expression that is ‘similar’ to 
the expression to be Proved.” Many 
other examples can be supplied of 
“schematic anticipations” in LT. 

4, The method that js applied to- 
ward solving the problem is fully speci- 
fied by the schematic anticipation. The 
counterpart in LT is that, upon receipt 
of the problem, the executive program 
for solving logic problems specifies the 
next processing step. Similarly, when 
a subproblem is posed—like “prove the 


theorem by substitution”—the response 
to this subproblem is the initiation of 
a corresponding program (here, the 
method of substitution). 

5. Problem solving is said to involve 
(a) finding means of solution, and (b) 
applying them (2, pp. 47-53). A coun- 
terpart in LT is the division between the 
similarity routines, which find “likely” 
materials for a proof, and the matching 
routines, which try to use these mate- 
rials. In applying means, there are 
needed both ordering processes (to as- 
sign priorities when more than one 
method is available) and control proc- 
esses (to evaluate the application) (2, 
pP. 50). 

6. Long sequences of solution meth- 
ods are coupled together, This coupling 
may be cumulative (the following step 
builds on the result of the preceding) 
or subsidiary (the Previous step was 
unsuccessful, and a new attempt is now 
made) (2, p. 51). In LT the former is 
illustrated by a successful similarity 
Comparison followed by an attempt at 
matching; the latter by the failure of 
the method of Substitution, which is 
then followed by an attempt at detach- 
ment. 

7. In cumulative Coupling, we can dis- 
tinguish Complementary methods from 
subordinated methods (2, p. 52). The 
former are illustrated by successive sub- 
stitutions and replacements in succes- 
sive elements of a pair of logic expres- 
sions. The latter are illustrated by the 
role of matching as a subordinate proc- 
ess in the detachment method. 

We could continue this list a good 
deal further. Our purpose is not to 
suggest that the theory of LT can or 
should be translated into the language 
of “directed thinking.” On the con- 
trary, the specification of the program 
for LT clarifies to a considerable ex- 
tent notions whose meanings are only 
vague in the earlier literature. What 
the list illustrates is that the processes 
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that we observe in LT are basically the 
same as the processes that have been 
observed in human problem solving in 
other contexts. 


PERFORMANCE AND LEARNING 


LT is primarily a performance ma- 
chine. That is to say, it solves prob- 
lems rather than learning how to solve 
problems. However, although LT does 
not learn in all the ways that a human 
problem solver learns, there are a num- 
ber of important learning processes in 
the program of LT. These serve to 
illustrate some, but not all, of the forms 
of human learning. 


Learning in LT 


By learning, we mean any more or 
less lasting change in the response of 
the system to successive presentations 
of the same stimulus. By this defini- 
tion—which is the customary one—LT 
does learn. 

1. When LT has proved a theorem, 
it stores this theorem in its memory. 
Henceforth, the theorem is available as 
material for the proof of subsequent 
theorems. Therefore, whether EL as 
able to prove a particular theorem de- 
pends, in general, on what theorems it 
has previously been asked to prove. 

2. LT remembers, during the course 
of its attempt to prove a theorem, what 
subproblems it has already tried to 
solve. If the same subproblem is ob- 
tained twice in the course of the at- 
tempt at a proof, LT will remember 
and will not try to solve it a secon 
time if it has failed a first. 

3. In one variant, LT remembers 
what theorems have proved useful in 
the past in conjunction with particular 
methods and tries these theorems first 
when applying the method in question. 
Hence, although its total repertory of 
methods remains constant, it learns to 
apply particular methods in particular 
Ways. 
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These are types of learning that 
would certainly be found also in human 
problem solvers. There are other kinds 
of human learning that are not yet rep- 
resented in LT. We have already men- 
tioned one—acquiring new methods for 
attacking problems. Another is modi- 
fying the descriptions used in searches 
for similar theorems, to increase the effi- 
ciency of those searches. The latter 
learning process may also be regarded 
as a process for concept formation. We 
have under way a number of activities 
directed toward incorporating new forms 
of learning into LT, but we will post- 
pone a more detailed discussion of these 
until we can report concrete results. 


What is Learned 


The several kinds of Jearning now 
found in LT begin to cast light on 
the pedagogical problems of “what is 
learned?” including the problems of 
transfer of training. For example, if 
LT simply stored proofs of theorems as 
it found these, it would be able to prove 
a theorem a second time very rapidly, 
but its learning would not transfer at 
all to new theorems. The storage of 
theorems has much broader transfer 
value than the storage of proofs, since, 
as already noted, the proved theorems 
may be used as stepping stones to the 
proofs of new theorems. There is no 
mystery here in the fact that the trans- 
ferability of what is learned is depend- 
ent in a very sensitive way upon the 
form in which it is learned and remem- 
bered. We hope to draw out the impli- 
cations, psychological and pedagogical, 
of this finding in our subsequent re- 
search on learning. 


CoNCLUSION 


We should like, in conclusion, only to 
draw attention to the broader impli- 
cations of this approach to the study 
of jnformation-processing systems. The 
heart of the approach is describing the 
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behavior of a system by a well speci- 
fied program, defined in terms of ele- 
mentary information processes. In this 
approach, a specific program plays the 
role that is played in classical systems 
of applied mathematics by a specific 
system of differential equations. 

Once the program has been specified, 
we proceed exactly as we do with tra- 
ditional mathematical systems. We at- 
tempt to deduce general properties of 
the system from the program (the equa- 
tions); we compare the behavior pre- 
dicted from the program (from the 
equations) with actual behavior ob- 
served in experimental or field settings; 
we modify the program (the equations) 
when modification is required to fit the 
facts. 

The promise of this approach is sey- 
eral-fold. First, the digital computer 
provides us with a device capable of 
realizing programs, and hence, of actu- 
ally determining what behavior is im- 
plied by a program under various en- 
vironmental conditions. Second, a pro- 
gram is a very concrete specification of 
the processes, and permits us to see 
whether the processes we Postulate are 
realizable, and whether they are suffi- 
cient to produce the phenomena. The 
vaguenesses that have plagued the the- 
ory of higher mental processes and other 
parts of psychology disappear when the 
phenomena are described as programs, 

In the present Paper we have illus- 
trated this approach by beginning the 
construction of a thoroughly operational 
theory of human problem solving. There 

is every reason to believe that it will 
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prove equally fruitful in application to 
the theories of learning, of perception, 
and of concept formation. 
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AVOIDANCE CONDITIONING THROUGH IRRADIATION: 


A NOTE ON P 
AND PSYCHOLOGI 
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Recent experiments by Garcia et al. 
(41, 13, 14) appear to be of great in- 
terest to psychologists studying that as- 
pect of learning called avoidance be- 
havior. These studies have shown that 
exposure to Co% acts as an uncondi- 
tioned stimulus (US) to produce avoid- 
ance behavior to the cues associated 
with the presentation of the radiation. 
In a footnote to one of these reports it 
was noted that “the physiological mecha- 
nisms through which radiation is ca- 
pable of acting as an unconditioned 
stimulus are as yet unknown” (13, p- 
40). It is the purpose of this note to 
indicate possible physiological mecha- 
nisms through which this effect occurs 
and to comment upon the importance 
of this phenomenon for psychology- 

Wynne and Solomon (22) have shown 
that interfering with the normal func- 
tioning of the peripheral autonomic 
nervous system (ANS) by surgical and 
pharmacological techniques interferes 
with an animal’s ability to acquire an 
avoidance response which, when learned, 
in some cases undergoes spontaneous 
extinction. Auld (3) reports that tet- 
raethylammonium, & drug which blocks 
impulses through the autonomic gan- 
glia, has a depressive effect upon spee 
of response in an escape-avoidance situ- 
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ation, although the possibility of a con- 
founding effect due to a depressant ac- 
tion of the drug upon motor activity has 
been noted by Brady (4). By using 
a serial discrimination problem and uti- 
lizing such measures as errors, trials to 
a criterion, and trials to extinction, one 
may evaluate the effects of autonomic 
block that are independent of the effects 
of the drug upon activity or motor Ca- 
pacity of the organism. Under these 
conditions it was found that tetraethyl- 


ammonium as well as hexamethonium, 


a second drug which has as one of its 
major effects the block of impulses 
through the autonomic ganglia, inter- 
feres not only with learning, utilizing 
escape from shock as motivation (2), 
put also interferes to some extent with 
learning in less stressful situations (1): 
From these studies one may conclude, 
although tentatively because of a short- 
age of confirmatory evidence, that the 
cues associated with the normal function- 
ing of the ANS are important in goal 
directed, integrated behavior. Whether 
this is true because of an increased drive 
state produced by ANS functioning (for 
a recent summary see 21) or because of 
the feedback to the cortex from the ANS 
(5) is of little import for the considera- 
tions presented in this paper. 

Ina thorough review of the relation- 
ships between irradiation injury and 
host defense mechanisms, Donaldson and 
Marcus (7) have shown that irradia- 
tion-induced stress produces an increase 
jn adrenal cortical function. This con- 
clusion is based upon the work of 
Dougherty and White (8), Patt et al. 
(20), North and Nims (19), and Hoch- 
man and Bloch-Frankenthal (15), who 
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found decreases in ascorbic acid after 
irradiation. Nims and Sutton (18) pre- 
sent data which indicate that the de- 
crease in adrenal cholesterol level as 
well as the polydipsia which accom- 
panies whole body radiation is the re- 
sult of increased activity of the pitui- 
tary-adrenal system. Lasser and Sten- 
strom (16), working with humans fol- 
lowing pelvic area irradiation, concluded 
that the adrenal cortex underwent defi- 
nite changes in the course of irradia- 
tion, although these changes did not 
appear to be correlated with clinical 
“radiation sickness.” 3 
If, then, the feedback associated with 
ANS functioning induced by irradiation 
has the same properties as the autonomic 
feedback associated with electric shock, 
a sudden loud noise, or any number 
of other stimuli, we would hypothesize 
that, just as the pairing of environmen- 
tal cues with electric shock causes these 
cues to acquire fear-inducing properties, 
so the pairing of environmental cues 
with irradiation should have this effect, 
It is these cues, then, which, when later 
presented without electric shock or ir- 
radiation, produce avoidance behavior, 
Gastric changes, general malaise, and 
nausea are also known to accompany 
irradiation (Burchtgott; * Davis et al., 
6; Leary, 17) and may well cause ANS 
feedback as- well as feedback through 
the proprioceptive fibers of the somatic 
nervous system. This feedback also 
may become associated with environ- 
mental cues in such a way that upon 
presentation of the environmental cues 
an avoidance response to these cues is 
manifest.’ 


3 Due to differences in dosage and exposure 
time, these studies are not directly applicable 
to the work of Garcia et al, However, it will 
be assumed tentatively that the time-dose pa- 
rameter used by Garcia et al: results in ANS 
effects similar to those produced in those stud- 
ies concerned more explicitly with irradiation 

changes. 
R communication from E, Furcht- 
gott, University of Tennessee, May, 1957, 


The impact, from the psychologist’s 
point of view, of the studies by Garcia 
et al. is twofold. First, it appears that 
a US may produce avoidance condi- 
tioning even when the subject is not 
aware directly of its action. Awareness 
in this sense refers to the functioning 
of particular peripheral somatic affer- 
ents. When using electric shock to the 
feet of a rat through a grid upon which 
it stands, pain receptors in the feet 
supply awareness of the location of the 
trauma. When the US is radiation, 
particularly at minimal dosages, the ani- 
mal has no awareness of the trauma 
through the somatic neryous system. 
Some knowledge, however, that it is 
undergoing a stress may be mediated 
through the action of the ANS, the 
changes in gastric functioning, and the 
general malaise and nausea produced by 
irradiation. 

Second, the use of radiation as a US 
provides a solution to the problem noted 
by Solomon and Brush (21) of the 
general lack in experimental psychology 
of aversive stimuli other than electric 
shock. In this area the implications of 
the studies by Garcia et al. for further 
research are vast indeed. In a T maze 
will rats learn to turn away from a goal 
box in which they have been irradiated? 
What is the function of the curve relat- 
ing drive strength and dosage level? 
What are the temporal parameters as- 
sociated with the use of radiation as a 
US? Willa rat learn an instrumental 
response, such as bar Pressing, to turn 
off a radiation source?® A review of 


5In a recent Study, Garcia and Kimeldorf 
(10) also reach the conc! 


logical mechanism unde: 
tioning through irradiat: 
of the ANS, although 
what different from 
paper. 

ë Since writing this paper, it has come to 
the author’s attention that a program of re- 
search at the U, S. Naval Radiological De- 
fense Laboratory has been undertaken with 
just such questions in mind (10, 12). 


that presented in this 


we 
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the behavioral effects of ionizing radia- 
tion may be found in Furchtgott (9). 
The relative importance of radiation as 
an aversive stimulus as compared to, let 
us say, bright light or intense sound, 
appears to be in the greater poten- 
tialities it offers for elucidating the role 


of 


the underlying physiological mecha- 


nisms in learning and the problem of 
learning without “awareness.” 
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THE PROBLEM OF REDUCTIONISM IN PSYCHOLOGY 


RICHARD JESSOR1 


University 


The recent upsurge of interest in 
physiological determinants and physical 
models of behavior has raised or re- 
opened certain fundamental questions 
about the logical status of psychology as 
an autonomous discipline among the sci- 
ences. Some of the discussions may 
fairly be summarized as implying that 
physiological or physical concepts are in 
some sense more basic than those of psy- 
chology and that, therefore, causal ex- 
planation of behavior will utimately be 
expressed in those terms. The contro- 
versy over the nature of the hypothetical 
construct (HC) in psychological ex- 
planation is a current example. Krech 
(16) has insisted that HCs must have 
neurophysiological reference or locus, 
and that psychologists should be content 
to define such things as needs, tensions, 
or cognitive structures as neural events, 
While this particular view has been criti- 
cized (2, 15), and there is much agree- 
ment that neurophysiological content js 
irrelevant in HCs (20), the more general 
orientation—that fundamental explana- 
tion will be reductive—continues to be 
influential. 

Psychologists are not alone in being 
concerned about this problem, Anthro- 
pologists such as Kroeber and White 
have felt called upon to defend the au- 
tonomy of their discipline against those 
who have tried to explain culture in 
terms of the concepts of psychology or 

even biology. (White caustically com- 
ments that some anthropologists “have 
sold their culturological birthright for 


1 The author was a Social Science Research 
Council postdoctoral fellow at the University 
of California during the time in which this 
paper was prepared. 


of Colorado 


a mess of psychiatric pottage” [25, p. 
xix]). In turn, prominent biologists 
such as Needham, Haldane, Woodger, 
and Bertalanffy have argued that their 
own discipline cannot be considered sim- 
ply an application of chemistry and 
physics. 

Obviously, an inherent attraction of 
reductive explanation is its implications 
for possible ways of unifying the sepa- 
rate scientific disciplines. Since the 
unity of science is an ultimate aim of 
many scientific workers, the reductive 
point of view is not likely to be aban- 
doned in the absence of reasonable al- 
ternative approaches to that goal. A 
commitment to reductionism, either be- 
cause of its implications for the unity of 
science, or because of the belief that it 
represents more fundamental explana- 
tion, undoubtedly influences the strategy 
of work of many scientists.? For this 
reason alone it would seem worth while 
to assess the doctrine in some detail. 

Some considerations of the reduction- 
ism problem have tended to dismiss it. 
One basis for dismissal is the assertion 
that the answer to the problem is en- 
tirely empirical in nature, depending on 
the course of future developments of 
science about which speculation is ad- 
mittedly dangerous, This view seems 
unwarranted; while the ultimate rela- 
tions among the sciences will be an em- 
pirical outcome, at any given point in 
time, the relations among the sciences 


2? Sound empirical 


work, of course, requires 
no defense, whether 


10 de motivated by reductionis- 
tic aims or not, Nothing said in this paper 
should be interpreted as depreciating the value 
of empirical or theoretical efforts to bridge the 
gap between disciplines. 
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are legitimate and important questions 
for logical analysis. Another basis for 
dismissal, especially where the disci- 
plines involved are psychology and 
physiology, is the adoption of the reduc- 
tionistic view as a logical or an in-prin- 
ciple certainty. The soundness of this 
view, also, is open to question, as will be 
shown. The purpose of this paper, then, 
is to examine the problem of reduction- 
ism and to try to make a small begin- 
ning in separating issues which are logi- 
cal in character from those which are 
empirical. In the course of our analy- 
sis, it will be contended that there are 
certain logical barriers to any present- 
day physiological reductionism. 


Tue DOCTRINE OF REDUCTIONISM 


The essence of reductionism would 
seem to include four related general 
propositions. (2) The several disciplines 
or sciences may be considered as hier- 
archically ordered from, ¢.8-, physics at 
the base through chemistry, biology, and 
psychology, to the social and historical 
disciplines at the top.* (b) The second 
essential aspect of reductionism is the 
proposition that the terms or concepts 
and the relations or laws of one dis- 
cipline may fully and without loss of 
meaning be translated into or deduced 
from those of another discipline. (c) 
Such deduction or derivability proceeds 
only in one direction, from lower to 


of course, easily 


3 Finer discriminations can, 5 
-known “border 


be made by including the well t 
disciplines such as biochemistry Or social psy- 
chology. But the fact that there is no sharp 
break between the sciences, and that it is fre- 
quently difficult to tell where one leaves © 
and the other begins, need not, in itself, ch 

lenge the autonomous existence of the sever 
disciplines, What may be implied by ordering 
them in a hierarchy from lower to higher will 


be discussed shortly; for the moment it is only 


important to consider that this is one of the 
Notions essential to the doctrine of reduc- 


tionism. 


higher * levels in the hierarchical order- 
ing, and hence the term “reductionism”; 
terms and Jaws of the higher discipline 
are “reduced” to those of a lower one. 
Thus, in our earlier example, the psy- 
chological term “cognitive structure” is 
considered translatable into—deducible 
from—terms belonging to neurophysi- 
ology. (d) The final aspect is the im- 
plicit or explicit proposition that the 
lower the level of terms employed to 
explain a given phenomenon, the more 
causal or fundamental or basic the ex- 
planation. This is really only a corol- 
lary of the first point if certain assump- 
tions about the nature of the hierarchical 
ordering are made. These four proposi- 
tions together would seem to constitute 
the essential meaning of reductionism as 
a general doctrine. ‘An adherent of that 
point of view may not, of course, sub- 
scribe to all of its aspects. 

The primary focus of this paper is 
upon the issues attending the reduction 
of psychology to physiology, and our 
evaluation of the doctrine will for the 
most part be oriented toward that spe- 
cific context. Within that context, the 
following comments are illustrative of 
the position which supports the doc- 
trine. “Logically and in principle, 
physiological reduction is a certainty. 


4 Terms in this paper referring to “position” 
in the hierarchy of the sciences, €g. higher- 
lower, upward-downward, top-bottom, are by 
no means to imply any valuative judgment. 
The meaning of position in the hierarchy has 
been variously specified, for example, as re- 
ferring to levels of abstraction or levels of 
integration, or as referring to the order of his- 
torical evolution within the universe of the sub- 
ject matter of the sciences, or even to the order 
of historical emergence of the sciences them- 
selves (25). As pointed out later, the concept 
of levels of science is not an analytically clear 
one (cf. 17). For present purposes it is suf- 
ficient to take note of the existence in scientific 
discourse of such a hierarchical concept, and to 
recognize the traditional general ordering which 
places the physical sciences at the base, the 
al sciences in the middle, and the social 


biologic: 
sciences at the top of the hierarchy. 
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Every bit of behavior and everything 
that can, like conscious contents, be 
defined in terms of behavior has its 
physiological correlate” (2, p. 442); 
“Relative to the ‘molar’ (or macro-) 
‘account given by behavioristic psychol- 
ogy, the nheurophysiological account is 
a micro-description of the very same 
events and processes” (10, p. 623); and 
finally, “. . . molar behavioristics is in 
theory completely reducible to under- 
lying neurophysiological principles. . . . 
A completely deterministic neurophysi- 
ology must of necessity permit deriva- 
tion of all molar behavioral laws” (20, 
PP. 261-262). An obvious corollary of 
these statements is the logical reduci- 
bility of physiological principles in turn 
to those of chemistry and ultimately of 
physics. 

Any challenge to these kinds of state- 
ments and their implications can be seen 
to require both logical and empirical 
arguments. We shall try to show where 
each kind is appropriate, 

To begin with, to speak of reducing 

one discipline to another Tequires that 
the terms or concepts of the one be dis- 
tinguishable from those of the other. 
This is not an easy requirement, Wood- 
ger (26) calls it an interesting method- 
ological question to inquire how one 
knows what belongs to the language of 
neurology and what belongs to that of 
psychology. Similarly, in commenting 
on the new terms which wil] accrue as 
science develops, Hempel ( 12) notes 
that it is by no means certain that each 
of these terms will be readily classifiable 
as physical or nonphysical, What seems 
to be a necessary Preliminary for our 
examination of physiological reduction- 
ism is some general criterion for separat- 
ing or identifying the terms of psychol- 
ogy and physiology. If this is accom- 
plished by defining psychology in a cer- 
tain way, it should make apparent some 
logical barriers to the Possibility of a 
physiological reduction. 


A FUNCTIONAL DEFINITION OF 
PSYCHOLOGY 


Admittedly, the definition of any sci- 
entific discipline is somewhat arbitrary; 
despite this, it is certainly possible to 
obtain adequate agreement on criteria 
for segregating one discipline from an- 
other or for grossly circumscribing the 
domain of a particular science. Among 
psychologists there is considerable agree- 
ment that the scope and subject matter 
of concern is the behavior of whole, hu- 
man? organisms. The diffculty with 
such a general statement is that the 
term “behavior” is not without am- 
biguity; psychologists have been no- 
toriously neglectful in providing a sys- 
tematic definition of a response. This 
laxness has, it is felt, obscured the con- 
ceptual boundary between psychology 
and Physiology. The present discussion 
of behavior as a Psychological concept 
follows the implicit orientation of all 
functional behavior theories, and more 
specifically, the approaches of Kantor 
(14), Brunswik (5, 6, 7), and Tolman 
(24). 

The central point of these approaches 
is that behavior, qua psychological, re- 
fers to an organism-environment inter- 
action or relationship, Tolman (24) 
specifically states that the complete 
ny behavior-act re- 


essentially the same point. In consid- 
ering the question of where to establish 
the boundaries of a behavior, Bentley 
(1) similarly concludes that behavior 
must be recognized as a transdermal 
process or event whose description must 
immediately ang functionally include the 
environmental and situational settings. 

5 The writer assumes that the study of ani- 


al behavior by Psychologists is merely pro- 
Paedeutic to a science of human behavior. 


REDUCTIONISM 


Within this framework, then, behavior 
viewed psychologically is interactional 
or relational in nature; its specification 
or identification at the referential level 
requires the specification of a particular 
context and a set of relationships thereto. 
Our definition of psychology, therefore, 
excludes the study of organisms or physi- 
cal environments per se, and behavior 
may not be referred to either alone. 
The laws of behavior of a discipline so 
defined refer to the dynamics of organ- 
jsm-environment functional interaction. 
The terms or concepts of those laws de- 
scribe what may be called an interaction 
or behavior space. 

This definition of psychology was un- 
dertaken as a means of providing a 
criterion for deciding whether a behavior 
term, or a law involving such a term, 
properly belongs to psychology or physi- 
ology. Our criterion requires of psy- 
chological terms that they have immedi- 
ate reference to a functionally defined 
environment or context. Before draw- 
ing the implications of these considera- 
tions for reductionism, a further com- 
ment on this kind of definition may be 
in order. Some objection may be raised 
to the relational or transdermal char- 
acter of the definition in that there is 
provided no palpable locus for a psy- 
chologically defined behavior. Those 
who raise such a query seem to be oper- 
ating within what Woodger (27) pic- 
turesquely describes as a “finger an 
thumb” philosophy of metaphysics, 1-¢+5 
the notion that a thing is real or exists 
only if it can in principle be picked up 


between the finger and thumb. Inter- 
actions or relations, though not simple 
s real 


Physical objects, are nevertheles: 
and concrete and can be precisely speci- 
fied by the conditions and course of their 


Occurrence. 


INCOMPLETE DERIVABILITY oF TERMS 


ow be asserted that the refer- 


It may n e 
defined, is pro- 


ence of psychology, as 
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foundly different from that of physiol- 
ogy. The terms and laws of the latter 
refer to intraorganismic or intradermal 
processes, or, at most, relations between 
an organism or its parts and the space 
defined by physics. They refer, in short, 
to the functioning of anatomical struc- 
tures or processes described by body 
parameters. To state one of the major 
contentions of this paper, the absence in 
contemporary physiology of any sys- 
tematic terms for describing the func- 
tional environment or context of be- 
havior would seem to preclude, on logi- 
cal grounds alone, any complete reduc- 
tion of psychology to physiology. These 
necessary and sufficient conditions for 
the terms of psychology cannot be de- 
scribed in physiological (or physical) 
terms alone. This “incompleteness” of 
the lower discipline’s language, in being 
able to specify only the physiological 
correlates but not the environmental cor- 
relates of a behavior or response, con- 
stitutes the logical impediment. This 
point obviously requires elaboration and 
further support. 

Let us take as an example an occur- 
rence described in common-sense, non- 
systematic language—the wave of an 
arm as two persons pass each other— 
and compare the systematic descriptions 
of it by the two disciplines being dis- 
cussed. The systematic language of the 
physiologist enables him to rely on only 
body parameters or physical terms. He 
may thus speak of arm-displacement, 
changes in muscle-tension, metabolic 
rate, blood-volume distribution, and 
neural reactivity. In none of these 
terms is it systematically possible to 
take cognizance of the social context. 
On the other hand, the psychologist 
may describe the event as waving a 
greeting to a friend, or even, since 
micromediation is generally of little in- 
terest to him, and equifinality generally 
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taken for granted, simply as greeting 
a friend. If psychological laws refer to 
interactions between organisms and func- 
tionally defined environments, only these 
latter descriptions of events can logically 
lead to the achievement of such laws. 
A discipline such as physiology, lacking 
such contextual terms, cannot therefore 
be considered logically equivalent; hence 
its adequacy cannot be guaranteed, even 
in principle, as a complete reduction 
base for psychology. 

One of the strongest empirical sources 
of support for some of these considera- 
tions may be found in the research 
analyses of Campbell (8, 9) and Smeds- 
lund (22, 23) evoked by the recent con- 
troversy over “what is learned.” As 
mentioned earlier, the term “response” 
has been conceptually neglected; this 
neglect seems, in turn, to be related to 
the inadequate attention given to the 
environment or context of behavior by 
many psychological theorists (cf. 13 in 
this connection). Brunswik (5, 6, 7), 
however, is one of those who has called 
attention to the problem in his emphasis 
upon distal achievement in the adjust- 
ment of an organism to its ecology. 
Beginning with this orientation, and ap- 
praising a variety of learning experi- 
ments, especially those dealing with 
transposition, both Campbell and Smeds- 
lund conclude that the learned response 
must be defined in environmental terms 
in order successfully to accommodate— 
predict—the actual research findings. 
Campbell states that “. . . the learned 
response is to be essentially defined in 
terms of a shift in the organism-environ- 
ment relationship rather than a motor 
response defined in terms of organism 
or body parameters alone” (9, p. 105). 

Scrutiny of the literature on reduc- 
tionism has showed only meager if any 

attention to the specific issue raised in 
6 That is, the same psychological event may 


rved by (partly constituted of) an almost 
ete variety of different physiological events. 


this paper. Two writers may be cited 
whose remarks, made in other contexts, 
are pertinent to the logical soundness 
of our contention. In his Tarner lec- 
tures (26), Woodger stresses the role of 
environment as a determinant at vari- 
ous biological levels from zygote to 
whole human organism. He rejects the 
body-mind dualism in favor of speaking 
about persons and developing a person- 
language. The notion of person re- 
quires, for its very definition, environ- 
mental specification, and the latter, he 
observes, requires words belonging to 
sociology. To treat persons otherwise, 
i.e., in terms of body parameters, loses 
sight of this fact: “But this is the only 
way in which we can treat them so long 
as we confine ourselves to the physical 
sciences, since these sciences do not pro- 
vide a vocabulary for speaking about 
them in any other way” (26, p. 261). 
Hempel has discussed the problem in 
relation to the possible derivability of 
all the laws of empirical science from 
those of physics (a logical corollary of 
Physiological reductionism), Affirming 
that not all the terms of empirical sci- 
ence are definable by means of the vo- 
cabulary of Physics, he asserts that 
“22a law Containing, say, certain bio- 
logical terms cannot, in general, be logi- 
cally derived from a set of physical 
laws, which contain no biological terms 
at all...” (12, pp. 320-321).7 Both of 
these writers seem in support of our 
point of view that there are logical bar- 
riers to any thorough-going physiologi- 
cal reductionism. The barriers reside 
in the absence of terms in the “lower” 
discipline which would enable the logical 
derivability of descriptions of the func- 
tional context of behavior and, thereby, 
the derivation of the laws of psychology. 


* To achieve such a derivation requires some 
law connecting the biological concepts with 
the physical concepts. “But those connecting 
laws are not Purely physical in character” (12, 


p. 321). And they have the character of 
empirical laws, 
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At least two kinds of questions can be 
raised about our analysis and are worth 
consideration at this point. The first 
of these is the possibility of overcoming 
the logical problem we have raised sim- 
ply by incorporating into physiology the 
category of terms we have suggested 
that it lacks. This is frequently what is 
implied by the phrase “in principle” in 
assertions about the possibility of re- 
ductionism. But as Sellars (21) has 
pointed out in another connection, this 
makes the entire problem an empty 
truism in that it involves a “tacit re- 
definition” of physiological theory to en- 
compass psychology. Such a redefini- 
tion is a statement about the future state 
of the sciences involved, and it there- 
fore transfers the discussion from logical 
to empirical grounds. That is, the ques- 
tion of whether a future physiology will 
be able to encompass psychology de- 
pends entirely upon the nature and di- 
rection of ongoing empirical development 
of both disciplines. It is to be noted 
here, too, that the meaning of reduction 
is always and only relative to a given 
state of the disciplines concerned. To 
suggest adding terms to physiology in 
order to make psychology deducible 
from it implies the “elevation” or “ex- 
pansion” of physiology just as much as 
it implies reduction of psychology. (In 
connection with these issues, see the 
Meehl-Sellars discussion of the logic 
of emergentism, 19.) Finally, it seems 
unlikely that scientific theory develops 
or advances by simple accretion of the 
terms of other theories. We will return 
to this point shortly, in considering re- 
ductionism and the unity of science. | 

The second kind of question which 
may be raised is implicit in the earlier 
quote from Bergmann. He bases his af- 
firmation of the logical certainty O 
physiological reductionism on the propo 
sition that “. . . everything that can be 
defined in terms of behavior has its 
physiological correlate” (2, P- 442). 


The essential point of this position would 
be that the environmental reference of 
behavior upon which we based our defi- 
nition must, if effective, be represented 
within the organism in its physiology or, 
especially, neurophysiology. This is, 
in a sense, a proximal approach to be- 
havior rather than a distal one. Certain 
arguments may be brought to bear 
against this position, such as the ab- 
scence of strict one-to-one proximal- 
distal correlation—e.g., in perception— 
and the theoretical significance of vicari- 
ous functioning or equifinality. The 
major reply which may be made, how- 
ever, is that the “recovery” of actual 
behavioral phenomena from physiologi- 
cal correlates requires the conceptual 
coordination of these correlates to en- 
vironmental contexts. Thus the issue 
raised originally reappears, the neces- 
sity for terms to represent or describe 
the context of behavior. That such co- 
ordination can or will be accomplished 
in the future is an empirical rather than 
a logical problem, and therefore not a 
logical certainty. 

Our discussion of reductionism up to 
this point has concerned itself largely 
with the logical problems inherent in the 
second and third essential aspects of 
the doctrine as outlined at the start of 
this paper. Proper attention to the first 
and fourth aspects would extend the 
paper beyond practical space limita- 
tions. Instead, we shall simply sketch 
some of the issue requiring attention. 

The first aspect has to do with the 
hierarchical ordering of the several sci- 
entific disciplines. Despite the wide- 
spread acceptance of the hierarchy no- 
tion—witness the frequent reference to 
“Jeyels” of science, the employment of 
terms like “basic” to contrast disci- 
Jines, and the characterization of cer- 
tain disciplines as “emergent” from 
others—it is not an analytically clear 
concept. Kroeber (17) remarks on the 
absence of any adequate attempt to ex- 
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amine systematically what the levels 
constitute or mean in terms of a theory 
of knowledge. Most of the discussions 
of what is meant by levels resolve into 
two positions which are generally con- 
sidered to exclude one another. One 
point of view conceives the levels of 
science to be a matter of methodology 
only, i.e., to refer to the kinds of pro- 
cedures employed by the various sci- 
ences, the size of their units of analysis, 
etc. The other point of view considers 
the levels to refer to substantive differ- 
ences in the events or phenomena dealt 
with by the various disciplines. Psy- 
chologists will be familiar with this con- 
trast from the Littman-Rosen (18) 
analysis of the molar-molecular prob- 
lem. A third possibility is that these 
two positions are correlated rather than 
mutually exclusive, namely, that sub- 
stantively different events require par- 
ticularly appropriate methodological pro- 
cedures for useful analysis. 
Each of these three views has certain 
implications for the doctrine of reduc- 
tionism. For example, the methodo- 
logical position would seem to favor the 
doctrine, since it assumes the events or 
phenomena to be the same and only the 
descriptions of them to be different. If. 
only the descriptions differ, e.g., in size 
of unit, it should be logically possible 
to reduce the larger units to their smaller 
constituents. On the other hand, to 
assert a substantive difference between 
levels would seem to be unfavorable to 
reductionism. The events or phenomena 
of higher levels are considered different 
from—not the same ones as—those of 
the lower levels, and therein lies the dif- 
ficulty in reducing descriptions of one 
kind of event to those of another kind. 
No vitalistic or dualistic considerations 
need be involved in speaking of events 
as different; certainly organic events 
may in general be separated from in- 
organic ones, for example. Feigl’s dou- 
ble-language theory (10) of the mind- 


body problem espouses the methodo- 
logical position in insisting that the fac- 
tual reference of the mentalistic, be- 
havioristic and neurophysiological lan- 
guages or levels of description is iden- 
tical, i.e., involves the very same events 
and processes. Others concerned with 
the problem seem to adopt the substan- 
tive position; thus White (25) speaks 
of culture as a distinct class of events, 
a distinct order of phenomena, Woodger 
tentatively concludes that “. . . perhaps, 
in spite of superficial appearances, 
person-acts and behaviour [defined in 
physical or physiological terms] are not 
quite the same things . . .” (26, p. 284). 
The position taken by the present writer 
in differentiating psychological from 
physiological terms may be seen as com- 
patible with the substantive view. What 
may account for substantive differences, 
whether the substantive vs. methodologi- 
cal distinction itself is useful or defensi- 
ble, the issues involved in emergentism, 
and any fuller analysis of the implica- 
tions of the hierarchy of sciences for re- 
ductionism must be deferred for discus- 
sion elsewhere. 

The final aspect of the reductionism 
doctrine has to do with the idea that 
causal explanation is advanced by the 
employment of terms of a lower-level 
discipline. In psychology, this notion 
identifies causal explanation with neuro- 
physiological reference. The key reason 
for this approach would seem to be the 
belief in the higher levels as simply de- 
rivable from, or applications of, terms 
and laws of lower disciplines. Once this 
idea is abandoned, causal explanation 
could Just as logically proceed upward 
to sociological and anthropological con- 
cepts. Knowledge of causality is prob- 
ably best divorced from the hierarchy 
of sciences notion and considered in- 
stead to vary with the scope of the net- 
work in which any concept is embedded: 

We have tried to examine some of the 
logical and empirical problems relate 
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to the doctrine of physiological reduc- 
tionism. Some of the considerations 
have led us to doubt that psychology 
can be reduced to physiology, though 
certainly many of the questions are of 
an empirical sort. Nothing thus far 
asserted in any way denies the possi- 
bility or the desirability of the unifica- 
tion of the sciences, the synthesis of psy- 
chology and physiology. What is ques- 
tioned is that such synthésis must pro- 
ceed by reduction, by one discipline de- 
vouring or incorporating the other. All 
of the sciences are developing, and their 
influence upon each other does not only 
proceed upward from physics. Berta- 
lanffy (3, 4), for example, sees certain 
biological developments, such as the 
notions of open systems, requiring ex- 
tensions of the conceptual system of 
physics. Unity of science for him refers 
only to the structural isomorphy of laws 
in the different fields of science, the ap- 
proach of General Systems Theory. 
Psychologists can also see the conceptual 
system of physiology as influenced by 
the data and concepts of psychology 
(11). Thus, unification of the sciences 
may proceed from above as well as from 
below. The continued autonomous de- 
velopment of each of the sciences will at 
least serve to specify the properties re- 
quired of any synthetic unifying scheme. 
This is probably part of what Brunswik 
had in mind when he wrote: “Insistence 
on reduction as a universal goal of sci- 
ence can only result in blighted spots on 
the landmap of scientific enterprise” (7, 


p. 237). 
SUMMARY 


This paper has had as its aim the in- 
stigation of renewed attention to the 
doctrine of reductionism, especially in 
terms of its implications for the rela- 
tionship of physiology and psychology- 
Despite the empirical character of the 
ultimate answer, it is asserted that the 
questions involved in the doctrine may 
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properly be the concern of a logical 
analysis. After briefly sketching four 
propositions which constitute the essen- 
tial notions of reductionism, the argu- 
ment focused upon the logical possi- 
bility of a complete translatability or 
derivability of the concepts and laws of 
psychology from those of physiology. 
The central contention was that the lat- 
ter, lacking terms to describe the be- 
havioral environment, was logically in- 
adequate as a base for a thoroughgoing 
reduction of the former. The remainder 
of the paper commented upon the rela- 
tionship of the doctrine to the idea of 
a hierarchical ordering of the sciences 
and to the possibility of achieving a uni- 
fication of science. 
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Hull (6, 7, 8) clearly distinguishes 
between a habit (sHr) and a drive 
state (D) and postulates that these 
interact in a multiplicative manner so 
as to generate a reaction potential 
(sEr). Thus: 


sEr = f(D X sHr) 


In his discussion of factors affecting 
response decrement, Hull further pos- 
tulates the development of two forms 
of inhibition: an innate, fatigue-pro- 
duced reactive inhibition (In) and con- 
ditioned inhibition (sIr). 

The former is clearly conceived of 
and defined as a negative drive state, 
while the latter is considered a nega- 
tive habit phenomenon. Despite this 
distinction, and in conflict with his 
earlier insistence on a multiplicative 
relationship between habitsand drives, 
Hull postulates that these two vari- 
ables interact in an additive fashion 
to produce inhibitory potential (Ír). 
When such an inhibitory potential is 
operative it is subtracted from reac- 
tion potential to give a resultant or 
net reaction potential (sĒr). Thus: 


sĒr =fl(D X sHr) — (In + sIz)] 


There has been considerable criti- 
cism of this formulation by suc 
writers as Gleitman et al. (5) and 
Koch (3). Osgood (10) and Zeaman 
(15) similarly recognize the inconsist- 
ency of the equation and propose that 


1This article is based upon a paper pre- 
sented to the XV International Congress © 
Psychology at Brussels in August 1957. The 
proposed equation was earlier made use of by 
Eysenck (4) in a theoretical analysis of 
“warm-up” phenomena. 


sIr, as a negativejhabit, be subtracted 


` directly from sHpr, a positive habit, 


while continuing to allow Ip to be 
subtracted directly from sZr._ Iwa- 
hara (9) recognizes that Ip, asa drive, 
must enter into some form of multi- 
plicative relationship, but, in order to 
avoid certain implications of his argu- 
ment, he then abandons sIpasa habit 
phenomenon and proposes to consider 
it only as a secondary drive. Such 
an abandonment of the concept of a 
negative habit would involve a radical 
revision of much existing theory, 
while its retention is not imcompati- 
ble with the achievement of internal 
consistency within the Hullian sys- 
tem. 

Iwahara bases his rejection upon 
only one possible modification of the 
basic equation, i.e., that inhibitory 
potential (Ír) be considered as the 
product rather than the sum of re- 
active and conditioned inhibition. 
Symbolically: 


İr = In X sIr 


He points out that this implies that 
the inhibitory potential would be zero 
when either Ir or slr is zero, a conse- 
quence clearly inconsistent with the 
known facts of learning. Conditioned 
inhibition, however, may, as a habit, 
also be energized by positive drives, 
e.g., When response decrement persists 
after long rest subsequent to massed 
practice. Similarly, reactive inhibi- 
tion may interact with positive habits 
to lower reaction potential when no 
conditioned inhibition is present, as 
at the commencement of massed prac- 
tice or extinction. 
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Thus it is suggested that net re- 
action potential may well be deter- 
mined by the multiplicative interac- 
tion of a resultant drive state (D — 
Ir), and a resultant habit state 
(sHr — sIr), so that: 


sĒr = J[(D — Ir) X (sHr — sIr)] 


The habit term in this equation is con- 
sistent with Osgood’s and Zeaman’s 
suggestion, while the drive term is 
consistent with Kimble’s (12) experi- 
mentally validated proposition that 
the amount of reactive inhibition a 
subject will tolerate is a function of 
his drivelevel. Thisequation is math- 
ematically equivalent to: 


sBr = fl(D X sHr) — (Ir X sHn) 
— (D X sIr) + (Ir X sIr)] 


the fourth term of which represents a 
paradoxical addition to positive re- 
action potential from the interaction 
of two inhibitory variables. Simi- 
larly, if both resultant drive and habit 
states are negative, the resultant re- 
action potential would be positive. 
Such a state of affairs would occur 
rarely in practice, but may furnish a 
theoretical explanation of such phe- 
nomena as the “ultraparadoxical” 
phase of inhibition observed by Pav- 
lov (11). The various inhibitory 
states of the organism induced by 
Pavlov in various ways may be con- 
sidered as inhibitory drive states akin 
to reactive inhibition. In advanced 
stages of such inhibition, Pavlov noted 
that positive conditioned stimuli 
tended to lose their effect, whereas 
“well developed negative stimuli” 
(i.e, when sIr was predominant) 
“acquired definite excitatory proper- 
ties.” 

Any amendment of the Hullian 
equation must serve, not only to make 
Hullian theory internally consistent, 
but also to generate theorems con- 
sistent with empirical data. One such 


sEa(o=2)(woui) 
X sEr (0=2) 
Er (Da1) 


REACTION 


DEVIATIONS FROM STIMULUS REINFORCED 


Fic. 1. Discrimination gradients at two 


levels of drive. See text for complete 
explanation. 


theorem, derived from the present 
amendment, refers to stimulus dis- 
crimination learning, around which 
an apparently adequate theory in 
Hullian terms has been developed, 
particularly by Spence (8, 14). 

The solid lines in Fig. 1 convention- 
ally illustrate the algebraic summation 
theory of discrimination,? in which net 
reaction potential (sĒz) is a result of 
the subtraction of generalized inhibi- 
tion (sIr) from generalized positive 
reaction potential (sEr). At the 
stage of learning illustrated, the net 
reaction potential attached to the 
nonreinforced or negative stimulus is 
well below the threshold for response, 
and the discrimination will be almost 
perfect. 


? The positive and negative generalization 
gradients in the figure were not calculated by 
formula but merely estimated, by eye, from 
textbook examples. Errors in reproduction and 
resulting differences between the two gradi- 
ents will account for the unconventional hump 
in the curves of net reaction potentials. 
These are irrelevant to the argument in this 
paper. 

*Hull claims that s#p cannot fall below 
sLr, the reaction threshold, because, when 
sEr = sLp, no further responses occur and so 
no further sZp develops. This argument 
neglects the effect of oscillation potential. 
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Let us now consider the effect of 
increasing drive in this situation; and, 
for convenience, let us assume that 
drive is, in fact, doubled. According 
to orthodox Hullian theory, this 
would have the effect of doubling the 
positive reaction potential but the 
conditioned inhibition would remain 
unchanged. The generalized net re- 
action potential (sĒr) would then be 
represented by the broken line in 
Fig. 1. The net potential attached to 
the nonreinforced stimulus is in fact 
higher than at the commencement of 
discrimination learning in the low 
drive situation: the discrimination 
would have broken down completely. 
Even with lesser increases of drive, 
its multiplicative relationship with 
habit strength would insure severe 
impairment of discrimination. 

If, however, the amended equation 
is employed, the doubled drive would 
have the effect of doubling both posi- 
tive and negative potentials. The 
generalized net potential would then 
be represented by the dotted line in 
Fig. 1 and, at any point along the 
stimulus dimension, would be double 
that in the low drive conditions. At 
the nonreinforced stimulus value this 
would involve only a slight increment 
in potential, as the initial value is al- 
ready subthreshold. Discrimination, 
while somewhat impaired, would not 
be expected to break down completely. 
_ Empirical evidence is clearly in 
favor of the prediction from the 
“amended equation. Skinner (13), for 

example, reports that under low hun- 
ger drive rats achieve almost perfect 
discrimination, performing bar-press- 
ing responses when the box is dark 
and not pressing when the box 1s 
lighted. Under increased food de- 
When sĒr = sLn, owing to oscillation, tte 


probability of response evocation wil 
Only when sip falls appreciably below slr 


will this probability fall to a low level. 
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privation the discriminations do not 
break down, but become impaired in 
that there is some “leaking” of posi- 
tive responses to the negative stimu- 
lus. 

Another situation in which the 
alternative equations would yield 
different predictions as to the effects 
of changes in drive is that in which 
habit strength is equal to conditioned 
inhibition. Calvin et al. (1) carried 
out an experiment to show that, de- 
spite continued reinforcement and 
under conditions of repeated massed 
practice, rats will ultimately refuse 
to traverse a runway for food and that 
this inhibition is retained over lengthy 
periods of time without such practice. 
Calvin's original experiment has been 
severely criticized by Dinsmoor (2) 
but, in a recent unpublished study, 
D. C. Kendrick achieved essentially 
similar results, using thirst motivation 
and controlling the factors relevant to 
Dinsmoor'’s criticisms. 

Jf animals, trained in this fashion, 
are retested under conditions of in- 
creased drive, Hull’s equation indi- 
cated that a reinstatement of the 
running {performance should occur, 
whereas under decreased drive the in- 
hibition should be more effective than 
in the original condition. According 
to the amended equation, change of 
drive in either direction, in that it 
affects siz and slz equally, should 
have no effect on the balance between 
these two variables. Any change of 
drive, in that it alters the stimulus 
situation, may affect performance, but 
running should be no more frequent 
with increased than with decreased 
drive. In a pilot study, employing 
ten animals and testing this predic- 
tion, Kendrick found that three ani- 
mals responded positively under in- 
creased drive. Two of these and 
one other responded under decreased 
drive. No responses occurred when 
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drive level was unchanged. A more 
rigorous follow-up of this finding is 
now in progress. 


SUMMARY 


An amendment of the Hullian 
equation is proposed which, for both 
excitatory and inhibitory potentials, 
is consistent with Hull’s distinction 
between habit and drive states and 
the postulated multiplicative inter- 
action between the two. Theorems 
derived from this equation are com- 
pared with the analagous consequences 
of Hull’s formulation in the light of 
available empirical data. 
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Xnformation theory, as developed 
narily -by Shannon (8), has had 
considerable impact on psychological 
“research. Information theory has 
been used as a model for many differ- 
ent types of behavior; it has also pro- 
vided psychologists with a statistical 
measure which has many useful prop- 
erties. Frequently the measure is 
used where the term information is 
somewhat inappropriate, and for this 
| reason Garner and McGill (3) have 
suggested that the term uncertainty 

be applied to the measure to divorce 

its statistical properties from the con- 

tent implications of the term informa- 

tion. Even when the mathematical 

properties of the uncertainty measure 

lead to implications for behavior situ- 

ation theory does not 


ations, inform 
Ecessarily define the problem ; rather, 
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the mathematical properties of the un- 
certainty measure simply make cer- 
tain relations clear which other statis- 
tical techniques have not been able to 
do. The purpose of this paper is to 
develop some equations of this sort 
involved in uncertainty analysis, and 
to point out the relevance of these 
equations to a few psychological 
problems. 

Two valuable properties of the 
measure of uncertainty, as pointed 
out by Garner and McGill (3), are: 
(a) that it is a nonparametric measure 
which can be applied to any set of 
categorized data, and (b) that the un- 
certainty of a variable can be parti- 
tioned into component parts, much as 
variance is partitioned in analysis of 
variance. A third useful property of 
the uncertainty measure is that an un- 
certainty analysis can be carried out 
in a completely symmetric form, in 
which it is unnecessary to distinguish 
between meanings or functions of 
the several variables involved in the 
analysis. This last property derives 
in part from the nonparametric nature 
of the measure, and is not a property 
of analysis of variance, in which differ- 
ent properties of the criterion and 

ictor variables are assumed. 
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ANALYSIS OF THREE VARIABLES 


For illustration purposes, let us con- 
sider first a three-dimensional matrix 
of data. Figure 1 shows schematic- 
ally such a data array and some of 
the terms which we will be using. 
The three variables are w, x, and y, 
and they can assume values w;, x;, and 
Yı Each cell in the matrix originally 
contains a number which represents 
the frequency of occurrence of cases 
having that particular combination of 
values of w, x, and y. For our pur- 
poses, we have shown the matrix in 
which these frequencies have been 
transformed into proportions, p (i,j,k), 
by dividing the cell frequencies by the 
total number of cases in the matrix. 
This matrix can be collapsed across 
any one variable to form three differ- 
ent two-variable matrices, or it can be 
collapsed across two variables to give 
distributions for one variable ata 
time. When two-variable matrices 


pli) 


Fic. 1. Symbolic 
three-variable matrix 
ables, w, x, Y, have cat 
yx. Each cell in the 
frequency which is dividi 
ber of cases in the matri 


representation of a 
of data, 


Surfaces, 
gives the 
and p(k). 


are used, the proportions in the cells 


are still computed as the frequency in _ 


a cell divided by the total number of 
cases, and can be obtained by sum- 
ming the proportions across the col- 
lapsed variable. 

Unidirectional analysis. When un- 
certainty is partitioned as in analysis 
of variance (3, 4), we start by identify- 
ing one of the variables, let us say y, 
as the criterion variable and the other 
two, w and x, as predictor variables. 


Now our problem is to partition the | 


uncertainty of y into its component 
parts. The basic partitioning equa- 
tion is 


UO) = UG:wx) + Vel) 1] 


where U(y) is the uncertainty of tite W 
y-distribution, based on p(k); U(y: 


w,x) is the total uncertainty in y 
which is predictable from w and x, 
and it is called a multiple contingent 
uncertainty; and U,2(y) is the residual 
or error uncertainty.2. This last term 
is a conditional uncertainty, and is 
computed by taking one combination 
of w and x at a time, determining the 
uncertainty of the y distribution, and 
then obtaining a weighted average 
for all of the w,x cells. 

By transposing terms in Equation 
1, we can define the multiple contin- 
gent uncertainty as 


U(y:w,x) = U(y) — Uw2(y) 


The right hand side of this equation’ 
can then be rewritten to show thé 
complete partitioning of the predict- 
able uncertainty as 


U(y:w,x) = U(y:w) J 
+ U(y:x) + U(y:w) [2a 


2 For many of the equations presented hen 
we shall not present calculational formulas in 
complete proofs, since these are available 
at least a related form in other articles 
pecially 2, 3, 4, 5), 
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The three terms on the right of Equa- 
tion 2a represent, in order, the uncer- 
tainty in y which is predictable from 
w, that which is predictable from Si 
and that which is preditable from 
unique combinations of wand x. The 
first two terms are simple contingent 
uncertainties, and are normally com- 
puted from collapsed two-variable 
matrices. The last term is an inter- 
action uncertainty, and is computed 
either as a residual or as the difference 
between a simple contingent uncer- 
tainty and the same contingent un- 
certainty computed with the third 
variable held constant, rather than 
with the matrix collapsed over that 
variable. (See 3, 4.) 

The partitioned uncertainties on 
the right of Equation 2a are com- 
pletely analogous to partitioned vari- 
ances in analysis of variance, and 
would in that case be called the main 
effect due to w, the main effect due to 
x, and the interaction between w and 
iC. 

Symmetric analysis. This approach 
to uncertainty analysis has the ad- 
vantage of being completely analogous 
to the more familiar analysis of vari- 
ance. It is possible, however, to ap- 
proach uncertainty analysis from an- 
other point of view which allows us to 
deal with the total matrix of data as a 
unit. We can call this type a sym- 
metric uncertainty analysis to dis- 
tinguish it from the unidirectional 
analysis in which the uncertainty of 
just one variable is partitioned. 

Since uncertainty analysis is non- 
metric, we can look at the three-di- 
mensional matrix shown in Fig. 1 as a 


e frequency distribution in which 


singl ortions of 


we have categories and prop 
total cases falling in each category. 
Thus we can compute the uncertainty 
of the total matrix of data, 


=— s{oG3,)J 


U(w,x,9) x [logpii.4)1} [3] 
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where U(w,x,7) is the total uncer- 
tainty in the three-dimensional ma- 
trix, and (i,j,k) is as defined above. 
Now this total uncertainty can be 
broken down in many different ways, 
some of them being 


U(w,x,y) 


= U(y) + Uy(%) + Uw) [4a] 
= U(w) + Uwlx) + Une(y) [4b] 
= U(w,x) + Uuz(y) [4c] 
= U(w,y) S Urey (x) [4d] 
= U(x,y) + Uz,(w) [4e] 


For these equations, the only terms 

needing additional explanation are 
U, (x) or equivalent forms, and U(w,x) 

or equivalent forms. Any term of the 
form U,(x) is a conditional uncer- 
tainty, and is computed by taking one 
value at a time of the subscripted 
variable, determining the uncertainty 
of the variable in parentheses, then 
obtaining a weighted average over the 
subscripted variable. For such a cal- 

culation, the matrix is collapsed over 

the third variable. A term of the 

form U(w,«) is a total uncertainty for 

a two-dimensional matrix, i.e., one 

collapsed over the third variable. It 

is computed from the (i,j), in this 

case, or the appropriate terms from 

any other combination of two vari- 

ables. 

U(w,x,y) is a measure of the total 

uncertainty actually obtained in the 

matrix. We can also determine the 

maximum total uncertainty which ` 
could have been obtained from the 

matrix by determining what propor- 

tion of cases would fall in each cell if 

there were no contingencies between 


any of the variables. This ideal 
proportion, P (i,j,k), is 
PIR) = ppp) [5] 


in much the same way as is done for 
chi square. From these ideal propor- 
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tions, representing those which would 
be obtained in a matrix with all vari- 
ables completely independent, we can 
compute the maximum uncertainty 
in the matrix, Umax(w,x,y), as 


Uinax(w,x,9) = — 2{[p Wo) P(k)] 
X Bogt [6] 


This maximum uncertainty can itself 
be partitioned into components, and in 
this case the partitioning results in a 
simple set of terms. 


Umax (w,%,y) = U(w) 
+ U(x) + Uy) [7] 


In other words, the maximum uncer- 
tainty which can be obtained in a 
matrix which is restricted only by the 
marginal totals is the sum of the un- 
certainties for the three variables 
taken one at a time. 

Now if we have the maximum un- 
certainty which can be obtained in a 
matrix, and the uncertainty which 
actually is obtained, the difference 
between them must give us a measure 
of the total contingency or interre- 
latedness in the three-dimensijonal 
matrix. We can define a measure of 
this total contingency as 


U(w:xry) = Umax (w,x,y) 
— Uwx,y) [8] 


The term on the left is the total con- 
tingent uncertainty in the matrix, and 
the colon notation has been used 
as for other contingent uncertainty 
terms. It should be noted, however, 
that there is no implication of direc- 
tion in the notation; the three sym- 
bols for the three variables can be 
written in any order, 

The total contingent uncertainty 
can be partitioned also, and the use 
of Equation 7 with Equations Ac, 
4d, or 4e leads to three different forms 
of the partitioning. 
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U(w:x:y) 
= U(w:x) + U(y:w,x) [9a] 
= U(w:y) + U(x: w,y) [9b] 
= U(x:y) + U(w:x,y) [9c] 


All terms in these equations have been 
defined above, and need no further 
clarification here. These equations 
show that the total contingent uncer- 
tainty can be partitioned into a multi- 
ple contingent uncertainty in which 
one variable is predicted from the 
other two, plus a simple contingent 
uncertainty between the two vari- 
ables used as predictors, and it makes 
no difference which variable is treated 
as a criterion and which as predictors 
for this statement to be true. 

As in Equation 2a, each of the three 
multiple contingent uncertainties can 
itself be partitioned into components, 
so that the total contingent uncer- 


tainty can be partitioned more com- 
pletely. 


U(wix:y) = U(w:x) + U(w:y) 
+ U(e:y) + U(y:wx) [10a] 


= U(w:x) + U(w:y) 
+ Ue:y) + Ulery) [10b] 


= U(w:x) + U(w:y) 

+ Ulery) + U(w:x%y) [10c] 
Thus the total contingent uncertainty 
consists of the three simple contingent 
uncertainties involving the three Pairs 
of predictors, plus the interaction 
term. Although the three interaction 
terms have been written differently 
for each of the above equations, it is 
clear from these equations that all 
three interaction uncertainties are 
equal. I n other words, there exists in 
the matrix all of the two-variable con- 
tingencies, plus a single interaction 
term which truly represents a three- 
dimensional pattern of inconsistencies. 
For this reason, it would be better to 
write the interaction uncertainty as 
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U (wxy) to make clear the uniqueness 
of that term for the given matrix. 

We can transpose the terms in 
Equation 9 to see the relation of each 
of the three possible multiple contin- 


gent uncertainties to the total con- ` 


tingent uncertainty, and we obtain 


U(y:w,x) = U(w:x:y) 

— U(w:x) [ila] 
U(x:w,y) = U(w:x:y) 

— U(w:y) [11b] 
U(w:x,y) = U(w:x:y) 


— U(«:y) [tic] 


With the equations written in this 
form, some interesting relations be- 
tween variables become clear. Each 
of these equations shows the multiple 
contingent uncertainty for predicting 
one of the three variables from the 
other two. The form of the equations 
is such that the amount of prediction 
available for each variable is equal to 
a constant less a variable term, which 
is in each case the contingent uncer- 
tainty between the two predictor vari- 
ables. In other words, the amount of 
predictable uncertainty available for 
any variable in the matrix is the total 
contingent uncertainty less the con- 
tingent uncertainty between the two 
variables from which the predicting is 
being done. Therefore, the amount 
of prediction available in any direction 
in the matrix is inversely related to 
the contingency between the two pre- 
dictors. For a given matrix the max- 
imum possible predictability will be 
obtained when the predictor variables 
are orthogonal, i.e., have no contingent 
uncertainty between them. This 
seems intuitively sensible, and the 
uncertainty analysis presented here 
makes it clear that for any set of data 
this relation must hold. 

In order to compute the exact 
amount of a multiple contingent un- 
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certainty involving one criterion and 
two predictor variables, it is necessary 
to carry out calculations which in- 
volve the three-dimensional matrix. 
For many sets of data, such a calcula- 
tion would be extremely laborious, if 
not prohibitive. Equation 11 shows 
that it is possible to determine how 
much better prediction is for one 
variable than for another without 
actually carrying out a complete three- 
dimensional calculation. Theamount 
by which one of the three multiple 


contingent uncertainties is greater 


than another is exactly the same as the 
difference in the contingent uncer- 
tainties between the two sets of pre- 
dictor variables, and these uncer- 
tainties can be calculated with the 
three-dimensional matrix collapsed 
to two dimensions. Actually, the 
interaction term is the one which re- 
quires a three-dimensional calculation 
to determine all the components of the 
multiple contingent uncertainty (see 
Equation 2a), and the essential fact 
which makes it possible to determine 
exact differences between multiple 
contingent uncertainties without the 
complete calculation is that the inter- 
uncertainty, as mentioned 
is the same for all three cri- 
terion variables. It is truly a term 
which applies to the complete matrix 
of data, and is not uniquely deter- 
mined for each variable used as a 
criterion, as it would appear to be from 
a unidirectional partitioning approach 
to uncertainty analysis. 


action 
above, 


HIGHER-ORDER MATRICES 


The entire discussion so far has 
been concerned with three-dimen- 
sional matrices. It is necessary to 
use at least three to show the nature 
of the principles, and it is easier to use 
just three to avoid undue complica- 
tion. However, a brief discussion of 
at least the four-variable case will 
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illustrate “some additional factors. 
We noted above that it is necessary 
to have two-dimensional uncertainty 
calculations in order to determine 
the differences between the various 
three-dimensional, multiple contin- 
gent uncertainties. In other words, 
calculations with as many variables as 
there are in the matrix are necessary 
to obtain absolute numbers for the 

- multiple contingent uncertainties, but 
calculations with one less variable 
suffice to determine the differences 
between the multiple terms. A simi- 
lar situation exists for the general case, 
as is shown by the following equations, 
in which v is the fourth variable: 


U(y:0,w,x) = Uw: w:x:y) 

— UQv:w:x) [12a] 
U(x:v,w,y) = Uv: wix:y) 

— U(v:w:y) [12b] 
U(w:v,x,y) = U(v:w:x:y) 


— U(v:x:y) [12c] 
U(v:w,x,y) = U(v:w:%:y) 


— U(w:x:y) [12d] 
Each of the four 


of course, 
as four or more 
Is impractical 
te interaction 
aree variables, 
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We noted above that when only three 
variables are: involved, exact differ- 
ences between the multiples can be 
determined because there is only one 
interaction term for the entire matrix. 
When four or more variables are in- 
volved, however, the interaction terms 
involved in each multiple contingent 
uncertainty are not identical. For 
example, with four variables, each 
multiple contingency requires four 
interaction terms, three involving 
three terms, and one involving all 
four terms. For any pair of multiple 
contingent uncertainties, three of 
these four interaction terms are identi- 
cal, but the fourth will normally be 
different. As an example, the com- 
plete set of terms involved in two of 
the four multiple contingent uncer- 
tainties are 
U(y:0,0,x) = U(y:v) + U(y:w) 
+ UG:x) + Uy) + U(x) 

+ Uwy) + U (axy) [13a] 

U(x:v,w,y) = U(x:v) + U(x: w) 
+ Uy) + Ua) 4 U (ay) 
+ U(wxy) + U (xy) 


[13b] 
where U (vwy) Is the four-variable 
interaction, Notice that only the 


and U(70y) 


uncertainty, a. 


in estimating the absolute magnitude 


- 
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is likely in that case. When even 
more variables are included, the ap- 
proximation by adding up the two- 
variable contingent uncertainties is 
more risky, since many more inter- 
action terms are involved. However, 
it should be remembered (3, 4, 5) that 
interaction terms can take negative as 
well as postive values, so that it is 
often still reasonable to assume that 
the sum of all the interaction terms 
is negligible, even though any one of 
them may not be. 


PSYCHOLOGICAL APPLICATIONS 


The mathematical developments 
briefly described in the preceding 
sections have many interesting appli- 
cations and implications in psycho- 
logical research. In some cases, their 
primary value is to clarify the mean- 
ing of computations which are com- 
monly carried out with the uncertainty 
measure. In other cases, these equa- 
tions lead to some quite specific pre- 
dictions about behavior which is re- 
lated to nonrandom sequential events. 
A few of these implications will be 
discussed as illustrative cases. 


Multivariate Information Transmission 


One common application of infor- 
mation theory in psychology has been 
the use of information-transmission 
measures to determine the “channel 
capacity” of the human observer 1n 
transmitting, verbally or otherwise, 
information about a series of events or 
stimuli. Following the paradigm de- 
scribed by Garner and Hake (2), a 
typical experiment requires subjects 
to make an absolute response to @ 
series of stimuli varying along some 
dimension, €-8-, brightness or loud- 
ness, and the experimental result is 
expressed as an amount of informa- 
tion transmitted between stimuli and 
responses. If the information trans- 
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mitted is less than the information 
contained in the stimuli and in the re- 
sponses, it is assumed to represent a 
channel capacity for the observer. In 
the more general terminology of the 
present paper, information transmis- 
sion is a two-variable contingent un- 
certainty. 

Although many experiments have 
been done using this approach, for 
illustrative purposes we will refer to . 
one by Garner (1). In this experi- 
ment, observers were asked to make 
an identifying response on a twenty- 
point scale to twenty different inten- 
sities of atone. From the two-dimen- 
sional matrix involving just stimuliand 
responses, the contingent uncertainty 
between stimuli and responses is com- 
puted, and this figure can be inter- 
preted as information transmitted. 
For this two-variable case, it makes 
no difference whether we state that 
information has been transmitted 
from stimuli to reponses or vice versa, 
or whether we talk about the amount 
of uncertainty in stimuli which can 
be predicted from responses, or vice 
versa. The measure of information 
transmission is completely bilateral, 
and can be given interpretation in 
either direction. 

Additional computations were made 
in this experiment, however, to allow 
multiple predictions. For example, 
the multiple contingent uncertainty 
for predicting the response from knowl- 
edge of the observer and of the stimu- 
lus was calculated, and this multiple 
contingent uncertainty was, naturally, 
larger than the simple contingent un- 
certainty. It seems quite reasonable 
that if more variables are isolated for 
prediction purposes, more prediction 
should be available. 

However, Equation 11 shows that, 
once more than one variable is used 
as a predictor, the bilateral interpre- 
tation of the contingent uncertainty 
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is no longer possible. Let us look at 
this over-all situation as a three-vari- 
able matrix involving stimuli (S), 
responses (R), and observers (0). 
Now the multiple contingent uncer- 
tainty for predicting R from S and oO 
is the total contingent uncertainty in 
the matrix minus the contingent un- 
certainty between S and O, the 
two predictor variables. In symbolic 
form, 


U(R:S,0) = U(R:S:0) 
= U(S:0) [14a] 


In this experiment, however, as in 
most such experiments, all observers 
received the same simuli. Therefore, 
there is orthogonality between S and 
O, and the contingent uncertainty 
between them will be zero, Thus 
the contingent uncertainty available 
for predicting R is the same as the 
total contingent uncertainty in the 
three-dimensional matrix, and is at 
a maximum for this particular matrix, 
Suppose now that we decide to 
predict stimuli from responses and 
observers—the type of prediction 
which is more realistic, since in the 
ordinary situation the response is 
transmitted, and from it and knowl- 
edge of the observer we make infer- 
ences or predictions about the stimu- 
lus. In this case, in symbolic form, 


U(S:R,O) = U(R:S:0) 
= U(R:0) [14b] 


Again the multiple contingent uncer- 
tainty is the total contingent uncer- 
tainty in the matrix minus the con- 
tingent uncertainty between the two 
predictors, Rand O. But in this case 
there is no reason why R and O should 
be orthogonal, and, in fact, jf they are, 
there is little point to adding O as 
a predictor variable. Since y (R:0) 
will be real and positive, prediction of 
S from R and O must be less than pre- 


diction of R from S and O, as in the 
former case. 

These relations do not indicate that 
no improvement in prediction of S can 
be obtained by making use of knowl- 
edge about O, but only that prediction 
of S cannot be as great as prediction 
of R. If we consider only the predic- 
tion of S, then prediction from both R 
and from O can be broken down as 
follows: 


U(S:R,0O) = U(S:R) + U(S:0) 
+ U(SRO) [15] 


The third term on the right is the 
interaction. This equation shows the 
problem clearly, since no prediction of 
Scan be obtained from just O as long 
as S and O are orthogonal. Addi- 
tional prediction may, however, be 
obtained from the interaction term, 
which is to say that only inconsisten- 
cies in relations involving observers 
are useful in Predicting S$. For pre- 
dicting R, however, both inconsistent 
and systematic differences by O in use 
of the R continuum Provide additional 
prediction. 


A simple example will 
these relations, 


» but in reverse relation 
to the two stimuli. Knowing the 


stimulus alone is now of no value in 
predicting the reponse, or vice versa. 


« 


eo 


= 
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However, if knowledge of the observer 
is added, then perfect prediction of 
the response from the stimulus or of 
the stimulus from the response is 
possible, due to the fact that the 
addition of observers as a predictor 
variable adds an interaction term in 
which all of the available prediction 
lies. 

As a concrete example, these two 
cases are extreme and serve only to 
illustrate the principles. One fact 
remains, however, which is that, in 
such a case, prediction of stimuli can 
never be better than prediction of re- 
sponses, and will normally be some- 
what poorer. 

This type of problem can be carried 
even further. In the same experi- 
ment, Garner actually determined the 
multiple contingent uncertainty when 
three predictor variables were used, 
the fourth variable being the stimulus 
which preceded the one being judged 
(P). Furthermore, in the experi- 
mental design, all pairs of stimuli were 
made to occur equally often, so that 
P and S were orthogonally related. 
Now the two types of prediction are 


U(R:S,0,P) = U(R:S:0:P) 

— U(S:0:P) [16a] 
and 
TEROP) = U(R:S:0:P) 

— U(R:0:P) [16b] 


following the form of Equation 12. 
In this case, when R is predicted from 
S, O, and P, the multiple contingent 
uncertainty is the total contingent 
uncertainty in the four-variable ma- 
trix minus the total contingent uncer- 
tainty in the three-variable matrix in- 
volving the three predictor variables. 
Since these three predictor variables 
were all orthogonal in the epenn 
tal design, this term is zero, and this 
multiple contingent uncertainty CON- 


tains all of the contingent uncertainty 
available in the matrix. 

When, however, S is predicted from 
the other three variables, the orthog- 
onal condition does not hold, since 
of the three variables only O and P 
are orthogonal. This term will thus 
be positive, and will be fairly sub- 
stantial, since it includes contingent 
uncertainties between R and O, and 
between R and P, as well as the three- 
variable interaction which must (in 
this case) also be positive. Thus, 
there would be substantial error in 
assuming that these two multiple 
contingent uncertainties are the 


same. 


Redundancy of Printed English 


One common application of infor- 
mation theory and uncertainty analysis 
is to situations where events occur in 
a sequence which is not completely 
random, i.e., in which there are se- 
quential dependencies. Much of hu- 
man behavioris concerned with making 
predictions about events which are 
unknown, or only partially known, on 
the basis of sequentially provided in- 
formation. Language is a very good 
example of such an event series in 
which there exist sequential depend- 
encies, and several studies have been 
done on printed English to determine 
how much predictability is available 
from sequences of letters. The term 
“redundancy” has been used to de- 
scribe the amount of uncertainty in a 
given letter which is predictable on the 
basis of other known letters. The 
equations described above point out 
some interesting aspects of redundancy 
of printed English. 

When we do an analysis of se- 
quences of letters, we form a data 
matrix in which the series of letters 
itself is one variable, and the series 
displaced by one or more units consti- 
tute the other variables. For present 
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purposes we shall refer to the first 
order series as variable w, the series 
displaced one step as variable v, and 
the series displaced two steps as vari- 
able y, and we will be dealing with 
a three-dimensional matrix. In this 
special three-variable case there are 
some restrictions on the contingent 
uncertainties which exist. For ex- 


ample, we know that the term U (wx) 
is identical to the term U(«:y), since 
in both cases we ate describing the 
contingent uncertainty for a one-step 
displacement of the series. In ad- 
dition, the contingent uncertainty 
U (w: y) is less than either of the others, 
since that is the contingency for a 
two-step displacement, and a letter 
cannot be predicted as well from one 


twa steps behind as from the immedi- 
No we Know trom Equation d 
fea For PEGS, Cetceent Ss ne 
etter of a series of three, must be 
greater than that for predicting either 
wory from the other two, since when 
x is being predicted, the contingent 
uncertainty between the two predictor 
variables, w and X, is less than the 
contingent uncertainties beliren the 
other pairs of predictor variables. In 
other words, prediction of the middle 
letter of a Sequence of three must be 
better than prediction of either end of 
the sequence on the basis of the other 
two. Similar relations hold for longer 
sequences of letters, and we ean make 
the general statement that if any 
fixed number of letters js available for 
prediction, the best prediction will 
occur if the letter being predicted is in 
the middle of the total sequence rather 
that at either end. Miller and Fried- 
man (6) have experimentally shown 
His t9 be true for sequences gf 5, 1, 


f 
and d4 letters. : 

Newman and Gerstman (7) have 
provided sate whieh alow uy ip axe 


just how much better prediction of 
middle letters should be, compared to 
prediction of end letters. Their data 
show, for the three-letter case, that 
the contingent uncertainty for a one- 
step displacement is 0.91 bit, and for 
a two-step displacement it is 0.42 bit. 
This difference is 0.49 bit, and refer- 
ence to Equation 11 shows thot this is 


the exact amount hy which prediction 
of the middle of three letters will be 
better than prediction of cither end. 


This is a substantial difference. 


veciiverd, (if, MOENIE eanticd 
on oth a 

BEY Gy a& 
S Pep Tani 


volve 


1 ple contingent 
Using again the New- 


man and Gerstman 
eleven-letter Sequence, 


uncertainty for predicting either end 


letter on the basi 9f the other ten is 
e that 


data, for an 


the contingent 


414 bita, while + 


middle letter, or is Medleting the 


ì the basis f 
letters on each side, ts 4.45 He ie 
seven-letter sequences, prediction of 


the end letters gives 4 tiultiple contin- 
Sent. uncertainty ef 1.95 bits, while 
itn ¿$ the Middle lettet gives 
at its. hese figures som par 
ae Wl with thio foc by Miller 
ai ‘riedman, althoy h the agree- 
i ië better for midd e letters than 
that humane ein from their data, 
ioe ERENS Make ectiv 
af indore m K€ More effective use 


PEN! presented symmetri- 
cally than they de of unilateral infor- 


maah HNEE their data show highor 


residual uncertaint; 
these ealeulations arene hath 


These telations Fat But ihe Head 


e — a — pee gi i paint 
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for a more accurate interpretation of 
the meaning of the term redundancy, 
This term has usually been defined for 
prediction of letters on the basis of 
preceding letters only, and the figure 
of 50 per cent redundancy is accepted 
as a reasonable approximation. Such 
a figure, however, can be interpreted 
quantitatively and accurately only 
when it is applied to this particular 
prediction situation. We afinat lafat 
from it how many letters in printed 
English can be omitted and still alloy 
perfect reconstruction. Clearly, with 
A igure Of 50 per cent redundancy for 
one direction only, we cannot expect 
perfect reproducibility unless we have 
this amount available on both sides 
of the item being predicted, In other 
agree am ier 100 per cent redune 
Saney jerfect prediction; sit Has 
damnes Ub PRMICGC Dibcroshee c oy PBs peg 
BP de ‘obtained only Whee g slagje 
deitar 36 Seed ZEN SY | SRE SE 
(uence, a0 that nantes RES 
is available on both sides of the letter 
being predicted. Perhaps our think- 
ing on this problem has been limited 
by our natural tendency to deal with 
antecedent and consequent events; 
but we çan think of both antecedent 
and subsequent events as determining 
the consequent VERN: and combina- 
tiong of these "certainly are mere 


effective than either alone. 


Response to a Continuing Series 


A type of behavior for which the re 
lations shown here have important 
implications is that in which a contin- 
ulng series of events occurs and a hu- 
man must make a differential Te 
sponse to each event as it occurs: 
Buch tasks a8 Sopying telegraphic 
code, taking dictation, reporting tar- 
get pgsitians 6H a tadar; and target 
Fey ae a O tn 
Te talk about a simpli Fal tas! 
thie sort; ef we ase tat an apera- 
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tor is read a series of numbers and he 
is required to push a different button 
for each different number he hears, 
much as he would if he were adding 
up a column of figures on a machine 
as they were being read to him. ‘The 
operator, of course, is trying to carry 
out his task with minimum error, so 
that he wants to get all the informa- 
tion possible about what number was 
road 10 him, Initially, Jet us suppose 
also that the numbers do not occur in 
a completely random order, but that 
cach number i8 to goie extent pre: 
dictable from the preceding ones ina 
falely simple way: Such # series) <: 
course, will show sequenti depend: 
encies, and, with rare exceptions, the 
contingent uncertainties between the 
series and its displacements will be 
teas es ri 
CE ace 
SEG fo DAM ME IE ano 
Bate MeSH {estii BS x : 
in thie paper that tie matii 
dictability for any single number will 
be obtained if the operator is able to 
use clin ome edane t ie, 
to uge both forward and backware pre 

orward 


diction: In order to use ot! 
however, 


and backward prediction, e 
he must delay his respoiide t0 A parti: 
ular number until he has heard some 


subsequent numbers: He musty in 
other wokas fa Of course, phenom 
series, This 18, 0 f nom- 
enon aoraianly observed in everyday 


life, 4 

Suppose we want to determine the 
optimum lag which such an operator 
should use. We can make some rea- 
sonable predictions, but these pre- 
dictions will depend on a number of 

actors. 

{, The memory function, tis clear 
that the operator cannot delay his rê- 
sponse 86 long that he hi Het remem- 
her All of the items whisk are paten? 


tially useful for prediction of any one 
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TABLE 1 
ILLUSTRATION OF EFFECT OF MEMORY AND CONTINGENT UNCERTAINTIES 
on Optimum LAG 
Lag of 2 Lag of 3 Lag of 1 
Init i ¢ . . 
Ont Step | Speers Weight | Weighted | Weight Weighted | wetgue Welghtea 
2 06 
= 30 5 
a -50 2 -10 4 -20 
-1 1.00 A 40 2 20 $ 60 
6 4 F 
a 1.00 8 -80 6 -60 1.0 1.00 
+2 50 1.0 50 8 40 
+3 30 1.0 30 
= weighted values 1.80 1.50 1.86 


item. Let us assume that a reason- 
able memory span for such a task is 
five units. Now if all five units can 
be remembered equally well, the 
optimum lag for the subject would be 
two units behind the one to which the 
response is being made, since this lag 
will give two units on either side of 
the one in question, with the optimum 
of completely symmetrical prediction, 

Actually, it is unrealistic to assume 
that all of the items can be remem- 
bered equally well, It seems more 
reasonable to assume that the items 
last heard will be more clearly remem- 
bered, and thus that the contingent 
uncertainty available from them will 
be more useful. We can illustrate the 
effects of such a memory function by 
using a weighting function which gives 
greater value to recent items than to 
remote items. Table 1 contains a set 
of purely hypothetical numbers to 
i memory factor 
would affect the optimum lag. The 
first column shows the unit step for 
each item, with “OQ” being the item to 
which the response is to be made, 
negative numbers referring to items 
which occurred before that one, and 
positive numbers being the later, more 
recent items. Column 2 shows the 


contingent uncertainty for each dis- 
Placement with respect to the “O” 
item. Colum 


which have been assigned to each į 
being used for i 


With the largest weight for the most 


tainty occurs With a Ig 
has a value of 1.86 pb; 
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what less than that which gives com- 
plete symmetry to the predictors. 

2. The amount of redundancy. A 
second factor which affects the opti- 
mum lag is the total amount of redun- 
dancy in the series, as well as its 
distribution over the various steps. 
This point need not be labored. A 
lag which gains no predictability can 
be of no value. Clearly if the series is 
completely random (no redundancy), 
there is no advantage to the operator 
in lagging because he gains no redun- 
dancy or information when he does. 
Similarly, if only one item on either 
side gives any predictability, then no 
lag beyond one step is ever of any 
value. 

3. The amount of noise. One last 
factor which will affect the optimum 
lag for the operator is the amount of 
noise whose effects have to be offset. 
In our illustration, the term noise can 
refer to acoustical noise, or in the more 
an refer to any factor 
which decreases the probability of a 
correct perception of the item. The 
importance of noise is that it deter- 
mines the extent to which the operator 
needs the predictability which he can 
obtain from the series redundancy. 
Suppose, for example, that the task is 
being carried out under ideal circum- 
stances, where each number can be 
heard quite clearly. Under these cir- 
cumstances, the operator can respon 
immediately with essentially perfect 
accuracy, and it would make little 
difference to him whether the series 1S 
partially redundant or not. In either 
case he would not lag, since there is 
nothing to be gained by it. If, on 
the other hand, there is: sufficient 
noise to make the perception imper- 
fect, then the operator will frequently 
find himself unsure of the proper re- 
will need some additional 
to improve the probabil- 
rect response. 


general case, it c 


sponse, and 


redundancy t¢ 
ity of his making a cor 


Thus, in summary, we can assume 
that with this type of task operators 
will lag if there is some noise in the 
situation and if the series has sequen- 
tial dependencies in it. If these con- 
ditions are satisfied, then the exact 
amount of the lag will depend on his 
memory for the items presented and 
on the distribution of the redundancy 
in the series of events. 


SUMMARY 


In this paper we have presented 
some equations originating in informa- 
tion theory which show some valuable 
properties of uncertainty analysis. 
These equations are presented in a 
form which emphasizes the fact that 
an uncertainty analysis can be carried 
out so that all variables involved can 
be dealt with in a completely sym- 
metric manner. It is possible, for 
example, to talk about a single num- 
ber which represents the total contin- 
gent uncertainty in a multidimen- 
sional matrix, and then to demonstrate 
how parts or all of this contingent un- 
certainty can be made available for 
prediction of just one of the variables. 
From such equations it is possible to 
demonstrate certain relations which 
must hold between prediction of differ- 
ent variables in a single matrix. 

These relations have important 
implications for several areas of psy- 
chology, either in clarifying the mean- 
ing of terms used in statistical analy- 
sis or in showing how behavior ought 
to occur if it is to maximize the use of 
information. As one illustration, it is 
shown that information transmission 
in the multidimensional case cannot 
be as good when stimuli are predicted 
as when responses are predicted. 
Another illustration shows that the 
usual measure of redundancy of 
printed English can be interpreted 
accurately only when prediction is 


196 


made unilaterally, but that maximum 
predictability of single letters occurs 
when prediction (on the basis of 
redundancy) is allowed to operate in 
both the forward and backward di- 
rections. A third illustration shows 
that when human operators have to 
respond to a continuing series which 
is nonrandom, optimum performance 
(with an error criterion) will occur 
only when the operator lags behind 
the series to which he is responding; 
furthermore, the optimum lag will be 
a function of the memory of the oper- 
ator, the amount of noise in the situa- 
tion, and the distribution of redun- 
dancy throughout the series. 

These examples are in no sense in- 
clusive, and the implications of equa- 
tions presented here range into many 
areas of psychology. These illustra- 
tions are sufficient, however, to point 
out several possible directions for re- 
search, and to clarify some concepts 
now in use in psychology. 


W. R. Garner 


REFERENCES 


1. Garner, W. R. An informational analy- 
sis of absolute judgments of loudness. 
J. exp. Psychol., 1953, 46, 373-380. 
2. Garner, W. R., & Hake, H. W. The 
amount of information in absolute 
judgments. Psychol. Rev., 1951, 58, 
446-459. 
3. GARNER, W. R., & McGILL, W. J. The 
relation between information and vari- 
ance analyses. Psychometrika, 1956, 
21, 219-228. 
4. McGill, W. J. Multivariate information 
transmission. Psychometrika, 1954, 
19, 97-116. 
5. McGILL, W. J., & QUAsTLER, H. Stand- 
ardized nomenclature. In H. Quast- 
ler (Ed.), Information theory in psy- 
chology. Glencoe, Ill.: Free Press, 
1955. Pp. 83-92. 

. MILLER, G. A., & FRIEDMAN, E. A. The 
reconstruction of mutilated English 
texts. Information and Control, 1957 
1, 38-55. k 

7. Newman, E. B., & GERSTMAN, L. T.A 

ae method for analyzing printed 
nglish. J. exp. Ps 
114-125, ARENA 182, dä, 


8. SHANON c E. A mathematical theory 
œ communication, Bell Syst, 
J., 1948, 27, 379-423, 623-686, “°° 


(Received February 27, 1958) 


Psychological Review 
Vol. 65, No. 4, 1958 


HEREDITY, ENVIRONMENT, AND THE QUESTION “HOW?”* 


ANNE ANASTASI 


Fordham University 


Two or three decades ago, the so- 
called heredity-environment question 
was the center of lively controversy. 
Today, on the other hand, many psy- 
chologists look upon it as a dead issue. 
It is now generally conceded that both 
hereditary and environmental factors en- 
ter into all behavior. The reacting or- 
ganism is a product of its genes and its 
past environment, while present envi- 
ronment provides the immediate stimu- 
lus for current behavior. To be sure, it 
can be argued that, although a given 
trait may result from the combined in- 
fluence of hereditary and environmental 
factors, a specific difference in this trait 
between individuals or between groups 
may be traceable to either hereditary or 
environmental factors alone. The de- 
sign of most traditional investigations 
undertaken to identify such factors, 
however, has been such as to yield in- 
conclusive answers. The same set of 
data has frequently led to opposite con- 
clusions in the hands of psychologists 
with different orientations. 

Nor have efforts to determine the 
proportional contribution of hereditary 


and environmental factors to observed 
individual differences in given traits met 
Apart from 


with any greater success. 
difficulties in controlling conditions, such 
investigations have usually been based 
upon the implicit assumption that he- 
reditary and environmental factors com- 
dditive fashion. Both ge- 
ychologists have repeat- 
ed, however, that a more 

othesis is that of interaction 
taani S 40). In other words, the 


(15, 22, 2 

i ivision of Gen- 
1 Address of the President, Division o 
a SSOD, merican Psychological As- 
sociation, September 4, 1957. 


nature and extent of the influence of 
each type of factor depend upon the 
contribution of the other. Thus the 
proportional contribution of heredity to 
the variance of a given trait, rather 
than being a constant, will vary un- 
der different environmental conditions. 
Similarly, under different hereditary 
conditions, the relative contribution of 
environment will differ. Studies de- 
signed to estimate the proportional con- 
tribution of heredity and environment, 
however, have rarely included measures 
of such interaction. The only possible 


conclusion from such research would 


thus seem to be that both heredity and 
environment contribute to all behavior 
traits and that the extent of their re- 
spective contributions cannot be speci- 
fied for any trait. Small wonder that 
some psychologists regard the heredity- 
environment question as unworthy of 
further consideration! 

But is this really all we can find out 
about the operation of heredity and en- 
vironment in the etiology of behavior? 
Perhaps we have simply been asking the 
wrong questions. The traditional ques- 
tions about heredity and environment 
may be intrinsically unanswerable. Psy- 
chologists began by asking which type 
of factor, hereditary or environmental, 
is responsible for individual differences 
in a given trait. Later, they tried to 
discover “ow much of the variance was 
attributable to heredity and how much 
to environment. It is the primary con- 
tention of this paper that a more fruit- 
ful approach is to be found in the ques- 
tion “How?” There is still much to be 
Jearned about the specific modus oper- 
andi of hereditary and environmental 
factors in the development of behavioral 
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differences. And there are several cur- 
rent lines of research which offer promis- 
ing techniques for answering the ques- 
tion “How?” 


VARIETY OF INTERACTION MECHANISMS 


Hereditary factors. If we examine 
some of the specific ways in which he- 
reditary factors may influence behavior, 
we cannot fail but be impressed by their 
wide diversity. At one extreme, we find 
such conditions as phenylpyruvic amen- 
tia and amaurotic idiocy. In these 
cases, certain essential physical pre- 
requisites for normal intellectual de- 
velopment are lacking as a result of 
hereditary metabolic disorders. In our 
present state of knowledge, there is no 
environmental factor which can com- 
pletely counteract this hereditary deficit. 
The individual will be mentally defec- 
tive, regardless of the type of environ- 
mental conditions under which he is 
reared. 

A somewhat different situation is 
illustrated by hereditary deafness, which 
may lead to intellectual retardation 
through interference with normal social 
interaction, language development, and 
schooling. In such a case, however, the 
hereditary handicap can be offset by 
appropriate adaptations of training pro- 
cedures. It has been said, in fact, that 
the degree of intellectual backwardness 
of the deaf is an index of the state of 
development of special instructional fa- 
cilities. As the latter improve, the in- 
tellectual retardation associated with 
deafness is correspondingly reduced. 

A third example is provided by in- 
herited susceptibility to certain physi- 
cal diseases, with consequent protracted 
ill health. If environmental conditions 
are such that illness does in fact de- 
velop, a number of different behavioral 

effects may follow. Intellectually, the 
individual may be handicapped by his 
inability to attend school regularly. On 
the other hand, depending upon age of 


onset, home conditions, parental status, 
and similar factors, poor health may 
have the effect of concentrating the in- 
dividual’s energies upon intellectual pur- 
suits. The curtailment of Participation 
in athletics and social functions may 
serve to strengthen interest in reading 
and other sedentary activities. Con- 
comitant circumstances would also de- 
termine the influence of such illness 
upon personality development. And it 
is well known that the latter effects 
could run the gamut from a deepen- 
ing of human sympathy to psychiatric 
breakdown. 

Finally, heredity may influence þe- 
havior through the mechanism of social 


ues for identify- 
These cues thus 
trictions or op- 
more subtle level 
and expectancies, 
self concept tends 
such expectancies, 


ecome important 
of behavior ( 12, 
e association þe- 


behavior would 
hat culture, 


miners 
Obviously thi 


ic examples are not hard 
to find The most familiar instances 
occur in connection with constitutional 
types, sex, and race. Sex and skin pig- 


? 
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mentation obviously depend upon he- 
redity. General body build is strongly 
influenced by hereditary components, 
although also susceptible to environ- 
mental modification. That all these 
physical characteristics may exert a 
pronounced effect upon behavior within 
a given culture is well known. It is 
equally apparent, of course, that in dif- 
ferent cultures the behavioral correlates 
of such hereditary physical traits may 
be quite unlike. A specific physical cue 
may be completely unrelated to indi- 
vidual differences in psychological traits 
in one culture, while closely correlated 
with them in another. Or it may be 
associated with totally dissimilar behav- 
ior characteristics in two different cul- 
tures. 

It might be objected that some of the 
illustrations which have been cited do 
not properly exemplify the operation of 
hereditary mechanisms in behavior de- 
velopment, since hereditary factors en- 
ter only indirectly into the behavior in 
question. Closer examination, however, 
shows this distinction to be untenable. 
First it may be noted that the influence 
of heredity upon behavior is always in- 
direct. No psychological trait is ever 
inherited as such. All we can ever say 
directly from behavioral observations is 
that a given trait shows evidence of 
being influenced by certain “inheritable 
unknowns.” This merely defines a prob- 
Jem for genetic research; it does not 


provide a causal explanation. Unlike 


the blood groups, which are close to the 


level of primary gene products, psy- 
chological traits are related to genes 
by highly indirect and devious routes. 
Even the mental deficiency associated 
with phenylketonuria is several steps 
removed from the chemically defective 
genes that represent its hereditary ba- 
sis. Moreover, hereditary influences 
cannot be dichotomized into the more 
direct and the less direct. Rather a 
they represent 2 whole “continuum © 


indirectness,” along which are found all 
degrees of remoteness of causal links. 
The examples already cited illustrate a 
few of the points on this continuum. 

It should be noted that as we proceed 
along the continuum of indirectness, the 
range of variation of possible outcomes 
of hereditary factors expands rapidly. 
At each step in the causal chain, there 
is fresh opportunity for interaction with 
other hereditary factors as well as with 
environmental factors. And since each 
interaction in turn determines the di- 
rection of subsequent interactions, there 
is an ever-widening network of possible 
outcomes. If we visualize a simple se- 
quential grid with only two alternatives 
at each point, it is obvious that there 
are two possible outcomes in the one- 
stage situation, four outcomes at the 
second stage, eight at the third, and 
so on in geometric progression. The 
actual situation is undoubtedly much 
more complex, since there will usually 
be more than two alternatives at any 
one point. 

In the case of the blood groups, 
the relation to specific genes is so close 
that no other concomitant hereditary or 
environmental conditions can alter the 
outcome. If the organism survives at 
all, it will have the blood group deter- 
mined by its genes. Among psycho- 
logical traits, on the other hand, some 
variation in outcome is always possible 
as a result of concurrent circumstances. 
Even in cases of phenylketonuria, intel- 
lectual development will exhibit some 
relationship with the type of care and 
training available to the individual. 
That behavioral outcomes show pro- 
gressive diversification as we proceed 
along the continuum of indirectness is 
brought out by the other examples 
which were cited. Chronic illness can 
Jead to scholarly renown or to intel- 
lectual immaturity; a mesomorphic 
physique can be a contributing factor 
in juvenile delinquency or in the at- 
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tainment of a college presidency! Pub- 
lished data on Sheldon somatotypes 
provide some support for both of the 
latter outcomes. 

Parenthetically, it may be noted that 
geneticists have sometimes used the 
term “norm of reaction” to designate 
the range of variation of possible out- 
comes of gene ,properties (cf. 13, p. 
161). Thus heredity sets the “norm” 
or limits within which environmental 
differences determine the eventual out- 
come. In the case of some traits, such 
as blood groups or eye color, this norm 
is much narrower than in the case of 
other traits. Owing to the rather differ- 
ent psychological connotations of both 
the words “norm” and “reaction,” how- 
ever, it seems less confusing to speak of 
the “range of variation” in this context. 

A large portion of the continuum of 
hereditary influences which we have de- 
scribed coincides with the domain of 
somatopsychological relations, as defined 
by Barker et al. (6). Under this 
heading, Barker includes “variations in 
physique that affect the psychological 
situation of a person by influencing the 
effectiveness of his body as a tool for 
actions or by serving as a stimulus to 
himself or others” (6, p. 1). Rela- 
tively direct neurological influences on 
behavior, which have been the tradi- 
tional concern of physiological psychol- 
ogy, are excluded from this definition, 
Barker being primarily concerned with 
what he calls the “social Psychology of 
physique.” Of the examples cited in 
the present paper, deafness, severe ill- 
ness, and the physical characteristics 

associated with social stereotypes would 
meet the specifications of somatopsycho- 
logical factors. 

The somatic factors to which Barker 
refers, however, are not limited to those 
of hereditary origin. Bodily conditions 
attributable to environmental causes op- 
erate in the same sorts of somatopsy- 
chological relations as those traceable 


to heredity. In fact, heredity-environ- 
ment distinctions play a minor Part in 
Barker’s approach. 

Environmental factors: organic. Turn- 
ing now to an analysis of the role of 
environmental factors in behavior, we 
find the same etiological mechanisms 
which were observed in the case of 
hereditary factors. First, however, we 
must differentiate between two classes 
of environmental influences: (a) those 
producing organic effects which may in 
turn influence behavior and (b) those 
serving as direct stimuli for psychologi- 
cal reactions. The former may be illus- 
trated by food intake or by exposure to 
bacterial infection; the latter, by tribal 
initiation ceremonies or by a course in 
algebra. There are no completely satis- 
factory names by which to designate 
these two classes of influences. In an 
earlier paper by Anastasi and Foley (4) 
the terms “structural” and “functional” 
were employed. However, “organic” 
and “behavioral” have the advantage of 
greater familiarity in this context and 
may be less open to misinterpretation. 


Accordingly, these terms will be used in 
the present Paper, 


Like hereditary factor. i 
: S, eny; 
influences of an Sta re 


Organic nature can also 
be a a along a continuum of indi- 
rectness with regard to their relati 
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corresponds closely to that of hereditary 
deafness cited earlier in the paper. 

Finally, we may consider an environ- 
mental parallel to the previously dis- 
cussed social stereotypes which were 
mediated by hereditary physical cues. 
Let us suppose that a young woman 
with mousy brown hair becomes trans- 
formed into a dazzling golden blonde 
through environmental techniques cur- 
rently available in our culture. It is 
highly probable that this metamorpho- 
sis will alter, not only the reactions of 
her associates toward her, but also her 
own self concept and subsequent be- 
havior. The effects could range all the 
way from a rise in social poise to a drop 
in clerical accuracy! 

Among the examples of environmen- 
tally determined organic influences 
which have been described, all but the 
first two fit Barker’s definition of soma- 
topsychological factors. With the ex- 
ception of birth injuries and nutritional 
deficiencies, all fall within the social 
psychology of physique. Nevertheless, 
the individual factors exhibit wide di- 
versity in their specific modus operandi 
—a diversity which has important prac- 
tical as well as theoretical implications. 

Environmental factors: behavioral. 
The second major class of environmen- 
tal factors—the behavioral as contrasted 
to the organic—are by definition direct 
influences. The immediate effect of 
such environmental factors is always 4 
behavioral change., To be sure, some 
of the initial pehilvioral effects may 
themselves indirectly affect the indi- 
vidual’s later behavior. But this rela- 
tionship can perhaps be best concep- 
tualized in terms of breadth and pêr- 
manence of effects. Thus it could be 
said that we are now dealing, not with a 
continuum of indirectness, as in the Eo 
of hereditary and organic-environment2- 
factors, but rather with a continuum 0; 


breadth. 


Social class membership may serve 


as an illustration of a relatively broad, 
pervasive, and enduring environmental 
factor. Its influence upon behavior de- 
velopment may operate through many 
channels. Thus social level may deter- 
mine the range and nature of intellec- 
tual stimulation provided by home and 
community through books, music, art, 
play activities, and the like. Even more 
far-reaching may be the effects upon in- 
terests and motivation, as illustrated by 
the desire to perform abstract intellec- 
tual tasks, to surpass others in competi- 
tive situations, to succeed in school, or 
to gain social approval. Emotional and 
social traits may likewise be influenced 
by the nature of interpersonal relations 
characterizing homes at different socio- 
economic levels. Somewhat more re- 
stricted in scope than social class, al- 
though still exerting a relatively broad 
influence, is amount of formal schooling 
which the individual is able to obtain. 

A factor which may be wide or narrow 
in its effects, depending upon concomit- 
ant circumstances, is language handi- 
cap. Thus the bilingualism of an adult 
who moves to a foreign country with 
inadequate mastery of the new language 
represents a relatively limited handicap 
which can be readily overcome in most 
cases. At most, the difficulty is one of 
communication. On the other hand, 
some kinds of bilingualism in childhood 
may exert a retarding influence upon 
intellectual development and may un- 
der certain conditions affect personality 
development adversely (2; 5) 10) 0A 
common pattern in the homes of immi- 
grants is that the child speaks one lan- 

age at home and another in school, 
so that his knowledge of each language 
is limited to certain types of situations. 
Inadequate facility with the language of 
the school interferes with the acquisition 
of basic concepts, intellectual skills, and 
information. The frustration engendered 
by scholastic difficulties may in turn 
lead to discouragement and general dis- 
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like of school. Such reactions can be 
found, for example, among a number of 
Puerto Rican children in New York City 
schools (3). In the case of certain 
groups, moreover, the child’s foreign 
language background may be perceived 
by himself and his associates as a sym- 
bol of minority group status and may 
thereby augment any emotional malad- 
justment arising from such status (34). 

A highly restricted environmental in- 

fluence is to be found in the oppor- 
tunity to acquire specific items of in- 
formation occurring in a particular in- 
telligence test. The fact that such 
opportunities may vary with culture, 
social class, or individual experiential 
background is at the basis of the test 
user’s concern with the problem of 
coaching and with “culture-free” or 
“culture-fair” tests (cf. 1, 2). If the 
advantage or disadvantage which such 
experiential differences confer upon cer- 
tain individuals is strictly confined to 
performance on the given test, it will 
obviously reduce the validity of the test 
and should be eliminated. 

In this connection, however, it is es- 
sential to know the breadth of the en- 
vironmental influence in question. A 
fallacy inherent in many attempts to 
develop culture-fair tests is that the 
breadth of cultural differentials is not 
taken into account. Failure to consider 
breadth of effect likewise characterizes 
certain discussions of Coaching. If, in 
coaching a student for a College admis- 
sion test, we can improve his knowledge 
of verbal concepts and his reading com- 
prehension, he will be better equipped 
to succeed in college courses, His per- 
formance level will thus be raised, not 
only on the test, but also on the cri- 
terion which the test is intended to pre- 
dict. To try to devise a test which is 
not susceptible to such Coaching would 
merely reduce the effectiveness of the 
test. Similarly, efforts to rule out cul- 
tural differentials from test items so as 


to make them equally “fair” to subjects 
in different social classes or in different 
cultures may merely limit the useful- 
ness of the test, since the same cultural 
differentials may operate within the 


broader area of behavior which the test 
is designed to sample. 
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(36) on rats indicated that “maze learn- 
ing ability” was inherited, we were still 
a long way from knowing what was 
actually being transmitted by the genes. 
It was obviously not “maze learning 
ability” as such. Twenty—or even ten 
—years ago, some psychologists would 
have suggested that it was probably 
general intelligence. And a few might 
even have drawn a parallel with the in- 
heritance of human intelligence. 

But today investigators have been 
asking: Just what makes one group of 
rats learn mazes more quickly than the 
other? Is it differences in motivation, 
emotionality, speed of running, general 
activity level? If so, are these behav- 
ioral characteristics in turn dependent 
upon group differences in glandular de- 
velopment, body weight, brain size, bio- 
chemical factors, or some other organic 
conditions? A number of recent and 
ongoing investigations indicate that at- 
tempts are being made to trace, at least 
part of the way, the steps whereby cer- 
tain chemical properties of the genes 
may ultimately lead to specified behav- 
jor characteristics. 

An example of such a study is pro- 
vided by Searle’s (31) follow-up of 
Tryon’s research. Working with the 
strains of maze-bright and maze-dull 
rats developed by Tryon, Searle demon- 
strated that the two strains differed in 
a number of emotional and motivational 
factors, rather than in ability. Thus 
the strain differences were traced one 
step further, although many links still 
remain to be found between maze learn- 
ing and genes. A promising methodo- 
logical development within the same 
ea is to be found in the recent 
h of Hirsch and Tryon (18). 
Utilizing a specially devised technique 
for measuring individual differences 1? 
behavior among lower organisms, these 
investigators Jaunched a series of stud- 
jes on selective preeding for behavioral 
characteristics in the fruit fly, Dro- 


researc 


sophila. Such research can capitalize 
on the mass of available genetic knowl- 
edge regarding the morphology of Dro- 
sophila, as well as on other advantages 
of using such an organism in genetic 
studies. 

Further evidence of current interest 
in the specific hereditary factors which 
influence behavior is to be found in an 
extensive research program in progress 
at the Jackson Memorial Laboratory, 
under the direction of Scott and Fuller 
(30). In general, the project is con- 
cerned with the behavioral character- 
istics of various breeds and cross-breeds 
of dogs. Analyses of some of the data 
gathered to date again suggest that 
“differences in performance are pro- 
duced by differences in emotional, mo- 
tivational, and peripheral processes, and 
that genetically caused differences in 
central processes may be either slight 
or non-existent” (29, p. 225). In other 
parts of the same project, breed dif- 
ferences in physiological characteristics, 
which may in turn be related to be- 
havioral differences, have been estab- 
lished. 

A second line of attack is the explo- 
ration of possible relationships between 
behavioral characteristics and physio- 
logical variables which may in turn be 
traceable to hereditary factors. Re- 
search on EEG, autonomic balance, 
metabolic processes, and biochemical 
factors illustrates this approach. A 
lucid demonstration of the process of 
tracing & psychological condition to ge- 
netic factors is provided by the identifi- 
cation and subsequent investigation of 
phenylpyruvic amentia. In this case, 
the causal chain from defective gene, 
through metabolic disorder and conse- 
quent cerebral malfunctioning, to feeble- 
mindedness and other overt symptoms 
can be described step by step (cf. 32; 
33, pp. 389-391). Also relevant are 
the recent researches on neurological 
and biochemical correlates of schizo- 
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phrenia (9). Owing to inadequate 
methodological controls, however, most 
of the findings of the latter studies must 
be regarded as tentative (19). 

Prenatal environmental factors pro- 
vide a third avenue of fruitful investi- 
gation. Especially noteworthy is the 
recent work of Pasamanick and his as- 
sociates (27), which demonstrated a 
tie-up between socioeconomic level, com- 
plications of pregnancy and parturition, 
and psychological disorders of the off- 
spring. In a series of studies on large 
samples of whites and Negroes in Balti- 
More, these investigators showed that 
various prenatal and paranatal disor- 
ders are significantly related to the oc- 
currence of mental defect and psychi- 
atric disorders in the child. An impor- 
tant source of such irregularities in the 
process of childbearing and birth is to 
be found in deficiencies of maternal diet 
and in other conditions associated with 
low socioeconomic status. An analysis 
of the data did in fact reveal a much 
higher frequency of all such medical 
complications in lower than in higher 
socioeconomic levels, and a higher fre- 
quency among Negroes than among 
whites. 

Direct evidence of the influence of 
prenatal nutritional factors upon subse- 
quent intellectual development is to be 
found in a recent, well controlled ex- 
periment by Harrell et al. (16). The 
subjects were pregnant women in low- 
income groups, whose normal diets were 
generally quite deficient. A dietary sup- 
plement was administered to some of 
these women during pregnancy and lac- 
tation, while an equated control group 
received placebos. When tested at the 
ages of three and four years, the off- 
spring of the experimental group ob- 
tained a significantly higher mean IQ 
than did the offspring of the controls. 

Mention should also be made of ani- 
mal experiments on the effects of such 
factors as prenatal radiation and neo- 


natal asphyxia upon cerebral anomalies 
as well as upon subsequent behavior de- 
velopment. These experimental studies 
merge imperceptibly into the fourth ap- 
proach to be considered, namely, the in- 
vestigation of the influence of early ex- 
perience upon the eventual behavioral 
characteristics of animals. Research in 
this area has been accumulating at a 
rapid rate. In 1954, Beach and Jaynes 
(8) surveyed this literature for the Psy- 
chological Bulletin, listing over 130 ref- 
erences. Several new studies have ap- 
peared since that date (e.g., 14, 21, 24, 
25, 35). The variety of factors covered 
ranges from the type and quantity of 
available food to the extent of contact 
with human culture. A large number 
of experiments have been concerned 
with various forms of sensory depriva- 
tion and with diminished opportunities 
for motor exercise. Effects have been 
observed in many kinds of animals and 
in almost all aspects of behavior, includ- 


ing perceptual responses, motor activity 
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performance in learning situations. All 
this research represents a rapidly grow- 
ing and promising attack on the modus 
operandi of specific environmental fac- 
tors. 

The human counterpart of these ani- 
mal studies may be found in the com- 
parative investigation of child-rearing 
practices in different cultures and sub- 
cultures. This represents the fifth ap- 
proach in our list. An outstanding ex- 
ample of such a study is that by Whit- 
ing and Child (38), published in 1953. 
Utilizing data on 75 primitive societies 
from the Cross-Cultural Files of the 
Yale Institute of Human Relations, 
these investigators set out to test a 
number of hypotheses regarding the re- 
lationships between child-rearing prac- 
tices and personality development. This 
analysis was followed up by field ob- 
servations in five cultures, the results of 
which have not yet been reported (cf. 
37). 
Within our own culture, similar sur- 
veys have been concerned with the di- 
verse psychological environments pro- 
vided by different social classes (11). 
Of particular interest are the study by 
Williams and Scott (39) on the as- 
sociation between socioeconomic level, 
permissiveness, and motor development 
among Negro children, and the explora- 
tory research by Milner (26) on the 
relationship between reading readiness 
in first-grade children and patterns of 
parent-child interaction. Milner found 
that upon school entrance the lower- 
class child seems to lack chiefly two ac- 
vantages enjoyed by the middle-class 
child. The first is described as “a warm 
positive family atmosphere or adult-re- 
lationship pattern which is more and 
more being recognized as a motivational 
prerequisite of any kind of adult-con- 
trolled learning.” The lower-class chil- 
dren in Milner’s study perceived adults 
as predominantly hostile. The second 


advantage is an extensive opportunity 


to interact verbally with adults in the 
family. The latter point is illustrated 
by parental attitudes toward mealtime 
conversation, lower-class parents tend- 
ing to inhibit and discourage such con- 
versation, while middle-class parents en- 
courage it. 

Most traditional studies on child-rear- 
ing practices have been designed in 
terms of a psychoanalytic orientation. 
There is need for more data pertaining 
to other types of hypotheses. Findings 
such as those of Milner on opportunities 
for verbalization and the resulting ef- 
fects upon reading readiness represent 
a step in this direction. Another pos- 
sible source of future data is the ap- 
plication of the intensive observational 
techniques of psychological ecology de- 
veloped by Barker and Wright (7) to 
widely diverse socioeconomic groups. 

A sixth major approach involves re- 
search on the previously cited soma- 
topsychological relationships (6). To 
date, little direct information is avail- 
able on the precise operation of this 
class of factors in psychological devel- 
opment. The multiplicity of ways in 
which physical traits—whether heredi- 
tary or environmental in origin—may 
influence behavior thus offers a rela- 
tively unexplored field for future study. 

The seventh and final approach to 
be considered represents an adaptation 
of traditional twin studies. From the 
standpoint of the question “Howe?” 
there is need for closer coordination be- 
tween the usual data on twin resem- 
plance and observations of the family 
interactions of twins. Available data 
already suggest, for example, that close- 
ness of contact and extent of environ- 
mental similarity are greater in the case 
of monozygotic than in the case of di- 
zygotic twins (cf. 2). Information on 
the social reactions of twins toward 
each other and the specialization of 
roles is likewise of interest (2). Espe- 
cially useful would be longitudinal stud- 
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ies of twins, beginning in early infancy 
and following the subjects through 
school age. The operation of differen- 
tial environmental pressures, the devel- 
opment of specialized roles, and other 
environmental influences could thus be 
more clearly identified and correlated 
with intellectual and personality changes 
in the growing twins. 

Parenthetically, I should like to add 
a remark about the traditional applica- 
tions of the twin method, in which per- 
sons in different degrees of hereditary 
and environmental relationships to each 
other are simply compared for behav- 
ioral similarity. In these studies, at- 
tention has been focused principally 
upon the amount of resemblance of 
monozygotic as contrasted to dizygotic 
twins. Yet such a comparison is par- 
ticularly difficult to interpret because of 
the many subtle differences in the en- 
vironmental situations of the two types 
of twins. A more fruitful comparison 
would seem to be that between dizygotic 
twins and siblings, for whom the he- 
reditary similarity is known to be the 
same. In Kallmann’s monumental re- 
search on psychiatric disorders among 
twins (20), for example, one of the most 
convincing bits of evidence for the op- 
eration of hereditary factors in schizo- 
phrenia is the fact that the degrees of 
concordance for dizygotic twins and for 
siblings were practically identical. In 
contrast, it will be recalled that in in- 
telligence test scores dizygotic twins re- 
semble each other much more closely 
than do siblings—a finding which re- 
veals the influence of environmental fac- 
tors in intellectual development. 


SUMMARY 


The heredity-environment problem is 
still very much alive. Its viability is 
assured by the gradual replacement of 
the questions, “Which one?” and “How 
much?” by the more basic and appro- 
priate question, “How?” Hereditary in- 
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fluences—as well as environmental fac- 
tors of an organic nature—vary along a 
“continuum of indirectness.” The more 
indirect their connection with behavior, 
the wider will be the range of variation 
of possible outcomes. One extreme of 
the continuum of indirectness may be 
illustrated by brain damage leading to 
mental deficiency; the other extreme, by 
physical characteristics associated with 
social stereotypes. Examples of factors 
falling at intermediate points include 
deafness, physical diseases, and motor 
disorders. Those environmental] factors 
which act directly upon behavior can be 
ordered along a continuum of breadth 
or permanence of effect, as exemplified 
by social class membership, amount of 
formal schooling, language handicap, 


and familiarity with Specific test items 
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reditary and environmental factors in- 
teract in behavior development. 
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HOW ARE INTERTRIAL “AVOIDANCE” RESPONSES 
REINFORCED? 


0. H. MOWRER AND J. D. KEEHN 1 
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Beginning with a rather incidental 
observation reported by May (14) in 
1948, a number of investigators (6, 22, 
25) have shown that in a conditioning 
set-up, involving a noxious uncondi- 
tioned stimulus, the incidence of inter- 
trial (spontaneous) responses will in- 
crease markedly if such responses are 
allowed to function instrumentally, i.e., 
to “avoid,” in the sense of postponing, 
the next “trial.” This effect is not ob- 
tainable if the unconditioned stimulus is 
preceded by a warning signal (27), but 
does occur regardless of whether the 
instrumental intertrial response is the 
same as or different from the response 
produced by the unconditioned stimulus 
(13; cf. 18).- The increased incidence 


of intertrial responses under the condi- 
tions described is in contrast to the 
previously reported observation that in- 
tertrial responses occur progressively 
less frequently in a conditioning situa- 
tion wherein they cannot function in- 
strumentally, i.e., where the intertrial 
interval is fixed or at least variable only 
by the experimenter (7, 19). 

The discovery that an instrumental 
procedure of the kind described will 
increase the incidence of intertrial re- 
sponses followed logically enough from 
the finding of Hunter (11), Brogden, 
Lipman, and Culler (3); and others that 
a response made to a warning (condi- 
ulus is more readily and 
fixated if, when such a re- 
nconditioned stimu- 


lus is om it 
willynilly (as in 
f absence; 1956-57, by benefit 


from the Department 
f Beirut. 
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ing) with the conditioned stimulus. And 
from this and related work it was soon 
evident that so-called avoidance learn- 
ing is not a matter of simple (stimulus- 
substitution) conditioning, but that it 
involves two distinct stages or “levels”: 
when a signal is paired with some pain- 
ful stimulus, what an organism learns 
first is to be afraid; and then it learns, 
perhaps by a quite different process, 
what to do about its fear (16, 17). Yet, 
with respect to the learning of instru- 
mental (avoidance) intertrial responses, 
there has been a tendency to adhere to 
the older one-stage stimulus-substitu- 
tion type of interpretation. The pur- 
pose of the present paper is to show, on 
both empirical and logical grounds, how 
much more satisfactory is a two-factor 
type of explanation. 

Previous RESEARCH AND ‘THEORY 
1946, Hart West- 
brook made what seems to have been 
the first attempt to set up an experi- 
mental situation in which intertrial re- 
sponses might function instrumentally 
in the manner already indicated. How- 
ever, since negative results were ob- 
tained (probably because training was 
not continued long enough; see later 
section on Interval-Avoidance Learn- 
ing), the details of this study need not 
be described here. Then, as already 
indicated, two years later May reported 
the occurrence of intertrial response 
Jearning in a situation designed to in- 
vestigate quite a different problem. This 
decidedly incidental, but striking, find- 
ing was reported thus: 


In the above experiments the swinging door 
over the barrier [separating the two sides of 


In the winter of 
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a shuttle box] was never locked. The rats 
could cross and recross at will. An indefinite 
number of crossings was possible during the 
training and testing trials. Sixty to 90 sec. 
were allowed between the last spontaneous 
crossing and the next training or testing trial. 
Thus, on the training trials an animal could 
avoid shock for a considerable time by cross- 
ing and recrossing at intervals of less than 60 
sec. Ten of the 12 experimental animals 
(Group A—door closed) and 11 of the con- 
trols learned this form of adaptive behavior 
and used it fairly regularly both on the train- 
ing and on the test trials. Very few of the 
animals in B-Group (door open), on the other 
hand, learned it (14, p. 71). 


In the summary of the study cited, 
May does not allude to this finding and 
elsewhere in his paper makes no attempt 
to interpret or analyze it. However, in 
a study by Bugelski and Coyer (6), 
briefly reported in 1950, the problem 
was attacked quite directly and with a 
theoretical issue in mind, implied by the 
title, “Temporal Conditioning vs. Anx- 
iety Reduction in Avoidance Learning.” 
A few years later, Bugelski described 
the procedure employed thus: 


Every 15 seconds a shock was applied in suf- 
ficient strength to force a rat over a hurdle. 
With repeated trials, the animals came to give 
almost perfect performance, jumping the hurdle 
in the general interval of 10 to 15 seconds, 
thus avoiding shock (5, p. 68). 


In accompanying graphs Bugelski 
shows that, although rats learned to 
“shuttle” more readily under the condi- 
tions indicated (ie, when the shock- 
shock interval was 15 seconds), another 
group of animals also solved the prob- 
lem with a shock-shock interval of 60 


seconds. Say Bugelski and Coyer of 
their study: 


The results support Hull’s account of avoid- 
ance behavior (Principles) as antedating re- 
sponses developing from escape reactions. 
Anxiety reduction and expectancy explana- 
tions need not be invokeq for the responses 
studied (6, p. 265). 


By this, these writers apparently mean 
that shuttling, when it occurs between 
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trials, is just the conditioned (antedat- 
ing) form of the escape reaction elicited 
by the shock which now, however, is 
produced by the experimental situation 
alone (without shock). Such an inter- 
pretation is, in a sense, classically Pav- 
lovian and, as these writers point out, 
does not require that fear be posited as 
an intervening variable or that fear re- 
duction be a form of reinforcement. 
Reasons for doubting the adequacy of 
this view will be given presently. 

Then, in 1953, Sidman, using a dif- 
ferent type of response and training 
procedure, also reported positive results: 

White rats were the experimental organisms, 
with lever pressing selected as the avoidance 
response. Shocks of a fixed 0.2-sec. duration 
were given to the animal through a grid floor 
at regular [commonly 5-sec.] intervals unless 
the lever was depressed. Each lever depres- 


sion reset the timer controlling the shock, thus 
delaying its appearance [by 20 


volved, approximately 50 ani: 
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avoidance rate is not simply some form of 
temporal conditioning in which the responses 
are triggered-off by the passage of a time in- 
terval (25, p. 158). 


Because of the brevity of the paper 
just cited, the precise nature of Sid- 
man’s theoretical position was not en- 
tirely clear, so we turn to the paper by 
Schoenfeld which Sidman cites. This 
paper is long and intricate and cannot 
be summarized here; but the following 
sentences, from the summary, are per- 
haps the most relevant ones for present 
purposes: 

Following several objections raised against 
[the] anxiety-reduction hypothesis, an alterna- 
tive explanation of avoidance is offered in 
which the response is conceived to be always 
a stimulus-terminating one. . . . It is argued 
that the avoidance response terminates [aver- 
sive] stimulus compounds in which propriocep- 
tive and tactile stimuli are important com- 
ponents. . . . If this formulation is correct, 
avoidance conditioning proves to be a form of 
escape training, and its avoidance function is 
incidental to its stimulus-termination function 
(23, p. 97). 


That the Schoenfeld position is, how- 
ever, open to varying interpretations is 
indicated by the following comment by 
Kamin: 


The usual assumption has been that spon- 
taneous responses result from conditioning to 
generalized apparatus cues (cf. 9, p. 76; 23, 
p: 89) (12, ps. 71)? 


While the theoretical picture was thus 
somewhat confused as of 1954, it is fair 
to say that the prevailing tendency was 
to explain the reinforcement of instru- 


2 Solomon and Brush (33, p. 266) allude to 
a study of intertrial avoidance learning Te- 
ported (as a personal communication) by 
Milner in 1955. In only one respect (which 
will be mentioned later) does this study €x- 
tend what had been learned from earlier ex- 
Solomon and Brush also give a 
comprehensive review of research on avoid- 
ance learning where the CS-US interval is 
systematically varied (with different subjects). 
However, this line of work, while germane, 15 
not directly relevant to the present discussion 
and will not be specifically cited. 


periments. 
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mental intertrial responses as noć involv- 
ing the reduction of a temporally condi- 
tioned fear (or anxiety), and perhaps 
not involving the reduction of fear at all. 


EXPERIMENTAL DISPROOF OF CLASSICAL 
CONDITIONING INTERPRETATION 


The fear-reduction hypothesis has 
been so successful in accounting for 
avoidance learning in general, and in 
resolving paradoxes generated by other 
theories, that there is strong presump- 
tion that it can also explain the learn- 
ing of instrumental intertrial responses. 
But at once there is at least a super- 
ficial difficulty. In “ordinary” avoid- 
ance learning, i.e., where there is a defi- 
nite and specific warning signal, it is 
simple and reasonable to suppose (a) 
that fear becomes conditioned to this 
signal and (b) that any response which 
turns the signal off (and averts the un- 
conditioned stimulus) will be reinforced 
by fear reduction. But, in the case of 
instrumental intertrial responses, there 
is no such specific warning signal, so 
that the fear-reduction hypothesis, if ap- 
plicable at all, must apply in some rather 
subtle and special way. In later sec- 
tions, this inference will be considered 
in detail; but first a simple experiment 
will be reported which clearly shows the 
inadequacy of the classical conditioning 
type of explanation. 

Kamin has already been quoted as 
saying: “The usual assumption has been 
that spontaneous responses result from 
conditioning to generalized apparatus 
cues.” If the classical conception of 
conditioning were valid, then, every time 
a subject experienced -electric shock in 
a given experimental situation and made 
a specific response thereto, there would 
be an increased tendency for the sub- 
ject to make that response to the stimu- 
lus compound which immediately pre- 
cedes the shock. Since it is the experi- 
mental situation as a whole, rather than 
a more specific stimulus, which here 
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precedes shock and thus becomes, in 
effect, the conditioned stimulus, the sub- 
ject should soon start making the re- 
sponse in question to that situation 
“spontaneously,” i.e., without a specific 
cue or signal. 

On the other hand, if one accepts the 
two-factor version of avoidance learn- 
ing, one must assume that, in order for 
intertrial responses to become more fre- 
quent, they must be reinforced by the 
Occurrence of fear reduction, 
one case, mere contiguity of stimulation 
(situation + shock) would Provide the 
conditions necessary for the increased 
Occurrence of Tesponses; 
whereas, in the other case, something 


responses 
would have to be in some sense func- 


tional, instrumental, problem solving. 
In the experiment to be described 
shortly, these two hypotheses are put to 
the test; but, before describing that ex- 
periment, it will be necessary to say a 
word about apparatus and procedure, 
In the instrumental intertrial learn- 

ing experiments already alluded to, 
Westbrook and Schaefer used a wheel- 
turning response, May and Bugelski and 
Coyer used shuttling, and Sidman used 
bar pressing. In casting about for an 
avoidance response that might be more 
“natural” for rats than either bar press- 
ing or wheel turning and also free of the 
conflict that necessarily exists (at least 
in the beginning) in a shuttling set-up, 
one of the present writers (J. D. K.) 
tried out a revolving cage similar to that 
employed in the experiment of Brogden, 
Lipman, and Culler (3) and found it 
highly satisfactory. With the response 
now merely a short run (which revolves 
the cage but otherwise gets the subject 
nowhere), rats show excellent interyal- 
avoidance learning; and with a few re- 
finements the apparatus has proved it- 
self to be reliable, flexible, and well 
adapted to the investigation of a wide 
range of related problems. The refine- 


ments are these: (a) a simple, silent 
“lock” which allows the cage to move 
easily in one direction but not in the 
other; (b) a device which quietly resets 


a minimum ‘of two or three inches; and 


made automatic and put beyond the 


es of experimenter Produced vari- 
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sions of habii 
jected to dai 


of one-hour durati ice 
following condities Ti fa ak 
shock from the grillwork wh h Saat 
tutes the “floor” of the revolvable wt 
came On every 2 Seconds : a 
on until the rat Tan a ref 


cage far €nough 


m 3 
ceived 37 shog s (indicat rea) re- 
teral spurs on the cumulatie” be la- 
line). Since a ț sve reg 


the minute for 60 lutas) Cocks 
been received during the 


‘How Are INTERTRIAL “AVOIDANCE” RESPONSES REINFORCED? 213 


CAGE REVOLUTIONS 


EXPERIMENTAL SESSIONS 


Fic. 1. Cumulative records of an albino rat's performance in a revolvable cage when between- 
trial (“spontaneous”) runs postpone the next shock by 20 seconds. An activity baseline, repre- 
senting the amount of running by the subject during the third of three daily habituation sessions 
of one hour each, under conditions of no shock, is shown in Graph 3-H. Graphs 1-T, 2-T, and 
3-T represent the amount of running during hourly sessions on each of three successive “train- 
ing” days. Shocks were applied only when the subject failed, for 20 seconds, to revolve the 
cage by the prescribed amount (two or three inches). Shocks are indicated by the lateral spurs 
on the underside of the graphs. 
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session if no interval running had oc- 
curred, it is clear that 143 shocks were 
thus avoided. On the second day the 
number of shocks not avoided dropped 
to 15, and on the third day this number 
dropped still further to 11. Here, mani- 
festly, is highly successful avoidance 
learning, in a situation that is basically 
simple both as regards apparatus and 
procedure. 

But such results, taken alone, are 
somewhat ambiguous. There are several 
ways in which they might be interpreted. 
In the absence of any other evidence to 
the contrary, one might infer, with Bu- 
gelski and Coyer, that the between- 
shock runs are simply conditioned (an- 
tedating) versions of the reaction of the 
subject to the shock itself. Lacking a 
more specific conditioned stimulus, such 
as a light or buzzer to herald the ap- 
proach of shock, the total situation 
might here be thought of as the con- 
ditioned stimulus, which, being con- 
tinuously present between shock pres- 
entations, elicits numerous, somewhat 
irregular occurrences of the running 
response, 

Or, consider another possibility. 
There is an obvious resemblance þe- 
tween the curves shown in Fig. 1 and 


those commonly reported, by Skinner 
(31) and others, where the subject, mo- 
tivated by hunger, gets reinforced only 
intermittently (with food) for pushing 
a bar but continues to make this Tre- 


sponse at a fairly rapid and regular rate 
between reinforcements. Unlike the 
hunger, which is continuous, the shock 
in this situation is discontinuous; but 
fear is presumably present much of the 
time while the subject is in the revolva- 
ble cage. So one might think of the 
response to shock-plus-fear as generaliz- 
ing to fear alone and thus Occurring 
many times on the basis of the sheer 
habit strength (or reflex reserve) gen- 
erated solely by the reinforcement pro- 
vided by escape from the shock-plus- 


fear on the occasions when shock was 
presented. Here, as in the Bugelski- 
Coyer type of explanation, there would 
be no need to posit any reinforcement 
of the interval responses themselves, 
through fear reduction; they could be 
thought of instead as pure extinction 
responses, 

Fortunately, a simple variation of the 
Procedure which leads to results such as 
those shown in Fig. 1 provides a fairly 
crucial means of determining whether 
the interval responses which are here 
exhibited are reinforced only on those 
occasions when shock is received or are 
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CAGE REVOLUTIONS 


EXPERIMENTAL SESSIONS 


Fic. 2. Cumulative records of an albino rat’s performance in a revolvable cage when a run 
(of two or three inches) turns off the shock (received from the cage floor) but, when occurring 


spontaneously, does not postpone the next shock. The four graphs 
The shocks, equal in intensity and number, are distributed in ti 


are labeled as in Fig. 1. 
me just as were those received 
This type of procedure provides a control 


by the subject whose performance is shown in Fig. 1. 
for certain hypotheses, cited in the text, which have been advanced to account for the type 
of interval-response avoidance learning shown in Fig. 1. 


revolutions for Animal E. As Curve 
3-T shows, when the shock came on, 
Animal C would still run far enough to 
turn the shock off (or perhaps a little 
further), but there was virtually no in- 
terval running. This animal had learned 
that the shocks were not avoidable, that 
they came regardless of what was done 
in the intervals between shocks; s50, 
there being no gain from such behavior, 
the animal, sensibly enough, abandoned 
it. This outcome, substantiated by re- 
sults obtained with three other pairs of 
animals, supports the hypothesis that 
interval responses, if they are to be 
perpetuated, must be reinforced in some 
manner other than by the reinforcement 
provided by the mere pairing of situa- 
tion and shock. 


INTERVAL AVOIDANCE LEARNING FUR- 
THER ANALYZED AND INTERPRETED 


From the empirical results just re- 


ported, it is evident that intertrial avoid- 


ance learning cannot be adequately ex- 
plained in terms of classical condition- 
ing. Some more intricate interpretation 
is obviously called for; the one which 
most readily suggests itself involves 
the now well authenticated concept of 
stimulus trace (1, 4). In an experi- 
mental situation of the kind under dis- 
cussion, each response, whether forced 
(by shock) or spontaneous, presumably 
sets up a neural reverberation (“immedi- 
ate memory”) which decays to zero (or 
at least to asymptote) in roughly 30 
seconds. If, therefore, electric shock 
be delivered to the subject whenever, 
let us say, the 20-second point on the 
stimulus trace is reached, one would ex- 
pect that fear would become conditioned 
to this point on the trace and, in gen- 
eralizing forward, would motivate the 
subject to make the response in question 
during the intertrial (20-second) inter- 
val, thus averting the impending shock. 
And, since each new occurrence of the 
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response would, so to say, “reset” the 
trace, there would be a reduction in fear 
and hence a reinforcement of the re- 
sponse itself (cf. 7, 20). 

Reasonable as it is a priori, this hy- 
pothesis did not, at first, seem to have 
much empirical justification. The West- 
brook study was undertaken with this 
hypothesis explicitly in mind; and when 
preliminary work failed to reveal the ex- 
pected effect, i.e., showed no tendency 
for the intertrial response to become 
temporally conditioned, the study was 
discontinued and the hypothesis ques- 
tioned. May, as already noted, was ap- 
parently the first to obtain good inter- 
trial avoidance learning, but he made 
no attempt to account for it theoreti- 

cally. Bugelski and Coyer, to be sure, 

reported (6) that, with repeated trials, 
the animals came to give almost perfect 
performance, jumping the hurdle in the 
general interval of 10 to 15 seconds, 
thus avoiding shock; but these writers 
advanced the now untenable antedating 
hypothesis (of Pavlov and Hull). Sid- 
man, finding no evidence of temporal 
conditioning in his own early researches 
(see Fig. 3), adopted an explanation, 
derived from Schoenfeld, to the effect 
that the “correct” interval Tesponse is 
learned because other intertrial Tesponses 


3 Imagine an experimental situation in which 
a relatively loud tone came on immediately 
after the subject had responded in a specified 
way and then gradually softened—or a patch 
of light which, upon occurrence of the re- 
sponse, became suddenly large and then grad- 
ually diminished in size. Suppose also that 
the subject never received shock when the 
stimulus was large (or loud) but was shocked 
when it diminished beyond a certain point. It 
would be in no way surprising if the subject, 
under these conditions, learned to use such a 
stimulus as the basis of successful avoidance 
behavior. Our conjecture is that, in interval- 
avoidance learning of the kind under discus- 
sion, something comparable is involved, the 
principal difference being that the changing 
cue, instead of being an external stimulus, is 
a stimulus trace produced by the last preced- 
ing instrumental (overt) response. 


get punished (in the sense of failing to 
postpone shock) and therefore, relative 
to the “correct” response, become in- 
hibited. Moreover, since the occurrence 
of the correct response stops or pre- 
vents the occurrence of other (fear- 
producing) responses, this response will 
have a tendency to reduce fear and thus 
be reinforced—an interpretation which 
makes no use whatever of the stimulus- 
trace concept. (Sidman did not, to be 
sure, use precisely the above terminol- 
ogy; but, in light of considerations to 
be advanced in the next section, this 
paraphrasing seems legitimate.) 
Schaefer, in the recent study already 
cited, likewise failed to find evidence of 
temporal conditioning in his subjects— 
in fact, their intertrial responses tended 
decidedly to pile up (as in Fig. 3) to- 
ward the beginning of the intertrial 
interval rather than toward the end 
thereof. And on the basis of this find- 
ing, Schaefer advanced an explanation 
of intertrial avoidance learning which, 
while in effect something like the one 
put forward by Sidman, is at least dif- 
fently phrased. According to Schaefer’s 
conjecture, what the subject in an ex- 
periment of this kind does is to dis- 
criminate between two conditions: situ- 
ation with correct response being made 
and situation with this response not 
being made. And, since the rat is never 
shocked while thus responding and is 
shocked when responding differently of 
not at all, then the rat, feeling safe 
while responding correctly and fearful 
at all other times, is differentially rein- 
forced on each occurrence of the correct 
response. In other words, by making 2 
particular response, the animal, so to 
Say, converts the situation from a dan- 
gerous one to a safe one and is thereby 
rewarded. Here the dilemma mentioned 
earlier—that there is no explicit danger 
signal in a situation of this kind whose 
termination can provide the specific oc- 
casion for fear reduction and reinforce- 
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made by a rat during Session 3 of an experi- 
ment reported by Sidman (26). The abscissa 
represents seconds of time elapsing between 
the preceding response and the next one. 
When the subject delayed by as much as 20 
seconds between responses, it then received an 
electric shock of brief (0.2 sec.) duration. As 
will be noted, most responses occurred shortly 
after the preceding one, rather than piling up 
in the true “danger zone,” i.e, during the lat- 
ter part of the intertrial period (see text). 


ment of the avoidance response—is re- 
solved by making the situation as 4 
whole, without correct responding, the 
“danger signal,” which is then “turned 
off? whenever and while the animal 
makes the requisite response. 

This is an eminently satisfactory 
theory (and is stated more simply and 
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plausibly than Sidman’s somewhat 
equivalent one); but it cannot, alone, 
account for all the now known facts. 
In 1954, in a paper entitled “The Tem- 
poral Distribution of Avoidance Re- 
sponses,” Sidman reported that “after 
continued training” there is a tendency 
for a “well developed time-discrimina- 
tion” to appear (26, p. 401). In two 
recent papers, Boren and Sidman (2, 29) 
show that, if training is continued long 
enough (to or beyond approximately 
100 hours), then interval-avoidance re- 
sponses do indeed take on the expected 
temporal distribution, i.e., they tend to 
pile up near the remote end of the inter- 
trial interval, rather than toward the 
beginning thereof (see Solomon and 
Brush’s account of Milner’s work, 33, 
p. 266). Hence, even though the Sid- 
man-Schaefer type of explanation be 
valid for the results first obtained 
in interval-response avoidance learning, 
such an explanation must obviously be 
supplemented if one is also to account 
for the opposite type of temporal dis- 
tribution of responses obtained after 
more extended training. One possi- 
bility, therefore, is that, in experimenta- 
tion of the kind under discussion, the 
first discrimination that is developed is 
a comparatively gross one, i.e, between 
experimental situations with and with- 
out the correct response being in prog- 
ress, and that a second stage of dis- 
crimination later develops in which the 
subject discovers that situation-without- 
correct-responding is dangerous only 
after an interval of time has elapsed. 
There is, however, also the possibility 
that the stimulus-trace type of explana- 
tion can account for all the facts. Let 
us assume that fear is most strongly 
conditioned to a point on the stimulus 
trace 20 seconds removed from the event 
that sets up the trace (ie., the preced- 
ing correct response, either forced or 
spontaneous). Let us further assume, 
as we reasonably may, that fear (at least 
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in the beginning) generalizes far for- 
ward on the trace. If, then, as already 
posited, occurrence of the correct re- 
sponse during the intertrial period sets 
up a new trace and reduces fear, one 
might expect this response to occur just 
as soon as the subject starts being 
afraid. In other words, there is nothing 
in the nature of the situation which 
would require that the subject wait until 
maximally afraid before reacting defen- 
sively. Therefore, the temporal dis- 
tribution of the incidence of the correct 
overt response would not be expected 
necessarily to provide a faithful picture 
of the temporal pattern of fear intensity. 
Occurrence of the correct overt inter- 
trial response might very well pile up, 
not where the fear would be maximally 
intense, but rather where it is first ex- 
perienced. 

Eventually, of course, one would ex- 
pect discrimination to develop and to 
offset this original overgeneralization, so 
that fear would be first experienced 
during the intertrial period, not far for- 
ward (due to generalization along the 
trace), but instead toward the latter 
part of the period where the objective 
danger really is. 

In a study published in 1956 under 
the title “Time Discrimination and Be- 
havioral Interaction in a Free Operant 
Situation,” Sidman has shown that it is 
a simple matter to get rats to respond 
with considerable accuracy to a 20- 
second temporal interval when they are 
thirst-motivated and water-reinforced 
for pressing one bar, at the appropriate 
interval, after having pressed (on cue) 


another bar, Thus, as Sidman ob- 
serves: 


The procedure is an o 
Pavlovian trace conditioni 
fication that the “trace” 
exteroceptive stimulus bi 
own behavior. . 


perant analogue of 
ng, with the quali- 
is initiated not by an 
ut by the organism’s 
» ‘a The response probability 

age of time, not because 
nhibition [as Hull (10) 
ng], but rather because 


the situation becomes more like that pre- 


vailing at the time of reinforcement (28, p. 
469). 


At first it may appear strange that a 
free-operant type of behavior (bar press- 
ing) can be temporally conditioned so 
readily when the subjects are thirst-mo- 
tivated; whereas, as already noted, in 
an avoidance-learning situation the same 
free operant for a long time occurs much 
too soon, i.e., long before the time when 
the unconditioned stimulus will actually 
Occur. But there is an instructive differ- 
ence: in the situation when the subjects 
are thirst-motivated, responses that oc- 
cur too soon go unrewarded, i.e., do not 
produce water and are thus extinguished 
relatively rapidly; whereas, in the situa- 
tion when the subjects are fear-moti- 
vated, each and every occurrence of the 

“correct” response during the intertrial 
period, no matter how early in that 
period, produces a decrement jn fear 

\ ght) and thus tends to be 
self-reinforcing, rather than self-extin- 
guishing. Here, as in many other situa- 
tions, we see the importance of differ- 
entiating between Primary and second- 
ary drive and drive reduction. 


Two Dirrerent CONCEPTIONS oF 
AVERSIVE” STIMULATION 


In earlier sections of this paper, we 
have seen how numerous and determined 
the attempts have been to account for 
various forms of avoidance behavior 
without recourse to the concept of fear 
as an intervening variable. There is, 
of course, every justification in science 
for trying to push the principle of parsi- 
mony as far as possible. But it must be 
remembered that sometimes a given type 
of parsimony proves to be quite im- 
possible and has to be replaced by a 
more complex conceptual scheme. It is 
the present writers’ belief that a simple 
one-step (S-R) conception of behavior 
is inadequate and that, at the very least, 
we must posit a two-step S-r:s-R model, 
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where S and R, as in the simpler S-R 
scheme, are “objective” stimulation and 
response, but where, in addition, r and 
s are intervening variables, of which fear 
(as both a reaction and as a drive) is 
a prime example. 

The attempt of Sheffield to interpret 
the Brogden-Lipman-Culler experiment 
(previously cited) solely in terms of the 
contiguity (stimulus-substitution) prin- 
ciple and thus to eliminate the necessity 
of positing fear as an intervening varia- 
ble illustrates the difficulties which such 
efforts commonly encounter. Says Shef- 
field: 


The findings for avoidable shock offer no 
evidence for a strengthening effect of avoid- 
ance and follow the conventional result in 
Pavlovian experiments. Reinforcement by 
shock strengthened conditioned running; omis- 
oe of shock led to extinction (24, pp. 174- 
175). 


The point can be illustrated by refer- 
ring back to Fig. 1. There it will be 
seen that, in Sessions 2-T and 3-T, in 
each of the two longest runs of interval- 
responding, there is, indeed, a tendency 
for this behavior to extinguish (as evi- 
denced by the flattening of the cumu- 
lative response curve) and to have to 
be eventually reinforced by shock pres- 
entation. Here, it might seem, is clear 
evidence that shock avoidance is not 
reinforcing and that the response of 
running is indeed dependent upon the 
occasional occurrence of the shock. The 
difficulty, of course, is the one that al- 
ways arises in such a situation as a re- 
sult of speaking as if only one response 
were involved. The present writers as- 
sume that shock is, of course, occasion- 
ally necessary to reinforce the fear re- 
sponse, but that the running response is 
reinforced by fear reduction not by 
shock. It is our assumption that, when- 
ever enough fear is present to motivate 
intertrial running, there will be an ensu- 
ing experience of fear reduction, with the 
result that this behavior is always reim- 


forced. It is, rather, the underlying 
fear that extinguishes and has to be 
eventually reinforced by presentation of 
shock. Our assumption is that, if fear 
could be maintained at an appropriate 
intensity by some other means, the in- 
tertrial running would never extinguish. 
This assumption is supported by the 
recently reported finding of Sidman, 
Herrnstein, and Conrad that: 


The effect of occasional free shocks, delivered 
independently of the monkeys’ behavior, was 
to increase the rate of avoidance responding. 
_.. The effect of the free shock persisted over 
as long as 300 experimental hours during 
which time no other shocks were delivered 


(30, p. 557). 


The authors’ interpretation of this ef- 
fect, in terms of Skinner’s theory of 
superstitious behavior, seems to us con- 
trived and nonparsimonious. 

In an unpublished paper, Sidman, 
Herrnstein, and Boren try—but, we be- 
lieve, unsuccessfully—to develop a con- 
ceptual scheme which will account for 
all forms of avoidance learning without 
recourse to the concept of fear as a 
motivating and reinforcing agent. In 
essence, the position is pristine behavior- 
ism and stems directly, as the authors 
acknowledge, from the empirical re- 
searches and methodological stance of 
B. F. Skinner. In his 1953 book, Sci- 
ence and Human Behavior, Skinner 
states his position in this connection 
with stark simplicity. The chapter on 
“Emotion” begins with this sentence: 
“The ‘emotions’ are excellent examples 
of the fictional causes to which we com- 
monly attribute behavior” (32, p. 160). 
Then the familiar, but highly debatable, 
assertion by William James is approv- 
ingly quoted to the effect 


that we feel sorry because we cry, angry be- 
cause we strike, afraid because we tremble, 
and not that we cry, strike, or tremble because 
we are sorry, angry, or fearful, as the case 
may be (32, p. 160). 
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And, later in the chapter, there is a 
section specifically headed “Emotions 
Are Not Causes.” 

Rarely are theoretical issues (in psy- 
chology) so beautifully explicit and 
“clean.” The present writers, along 
with many others, take the Position that 
the emotion of fear is causal; that a one- 
step (simple S-R) psychology will not 
work (17,21); that, as a minimum, two 
causal steps are necessary to account 
for behavior; and that fear, in the case 
of so-called avoidance behavior, is an 


essential intermediate cause or determi- 
nant. 


On the other hand, Skinner and those 
of like persuasion hold that this is not 
at all the case and that a “chaining in- 
terpretation,” not greatly different from 
the earlier Watsonian notion of reflex 
chaining, is “capable of handling most 
of the data and of extension to other 
findings, while at the same time main- 
taining a degree of parsimony and in- 
ternal consistency that we were unable 
to achieve by means of the alternative 
considerations” (unpublished man: 
by Sidman-Herrnstein-Boren), 

This is not the time or place to enter 
into a detailed critique of the chaining 
interpretation, but one particularly ques- 
tionable aspect thereof may be per- 
tinently noted, namely, the tendency to 
speak of stimuli which have been paired 
with punishment as being aversive, 
This, we believe, involves a logically in- 
defensible attempt to avoid the more 
likely Possibility that such stimuli, after 
conditioning, become aversive because, 
and only in the sense that, they now 
elicit fear and that it is this latter state, 

and not the eliciting stimulus, which 
provides the aversive, motivating ele- 


ment in the situation (cf. Solomon and 
Brush, 33, p. 245 ff.). 


uscript 


SUMMARY 


Tt is now empirically established (a) 
that responses which occur between 
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trials in a noxious conditioning situation 
occur progressively less frequently if 
trials occur ineluctably, i.e., at fixed in- 
tervals, and (b) that between-trial re- 
sponses occur progressively more fre- 
quently when the next trial is delayed 
each time such a response is made. 
There is, of course, no mystery as to 
why intertrial (spontaneous) responses 
disappear in the former situation: they 
accomplish nothing and, if they happen 
to occur late in the intertrial interval, the 
ensuing noxious stimulus acts, in effect, 
as a punishment. 

But there has been considerably less 
agreement as to how it is, under the 
second set of conditions mentioned, that 
intertrial responses are reinforced and 
Perpetuated. A number of writers have 
taken the position that here the inter- 
trial response (made to the experimental 
situation as a whole, rather than to a 
specific danger signal) js just the ante- 
dating (classically conditioned) form 
of the response made to the Noxious, un- 
conditioned stimulus on the trial. In 
the present paper various considerations 
are cited which indicate that this ex- 
planation is too simple and that a more 
complex explanation, involving the in- 
tervening advents of fear arousal and 
fear reduction, is necessary and gen- 
erally sufficient, 

Intertrial avoidance learning, as a 
laboratory Phenomenon, deserves con- 
tinued study, both because of its in- 
trinsic theoretical interest and because 
of its rather patent relationship to com- 
pulsive (neurotically repetitive) be- 
havior and to the Psychology of work. 
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The study of cumulated jnd scales 
began with F echner; Fechner’s law is 
such a scale, Psychophysicists have 
been deriving such scales and com- 
Paring them with scales derived in 
other ways, notably by fractionation, 
ever since, and a lot of controversy 
has resulted. The controversy js 
particularly hot at Present because 
Stevens and Galanter (16) and Stevens 
(14) have assembled a lot of data 
which indicate that cumulated jnd 
scales do not agree with magnitude 
scales derived by other methods for 
intensity continua such as loudness, 
brightness, and pain, 

Unfortunately, Fechner's proce- 
dure for cumulating jnds, which has 
been widely defended but not widely 
applied since his day, rests on an as- 
sumption which is inconsistent with 
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versity. 
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help at various stages in t 
the paper, 


one of his definitions. This means 
that cumulated jnd scales developed 
by his procedure are incorrect, and 


other kinds of scales are meaningless. 

This paper begins by showing that 
Fechner's method contains internal 
contradictions for all but a few special 
Cases, and that it cannot be rescued 
It goes on to 


; hematically ap- 
Propriate method for cumulating jnd’s. 


This method turns out to be the sim- 


top of each other, like a stack of 
Unfortunately, the detailed 
mathematical equivalent of this very 
simple operation is often fairly com- 


rent controversy about 
ods in Psychophysics, 


The model ofa Sensation scale, The 


psychophysical Model of 
scale is q Mathem 
sensation scale is an 


scaling meth- 


y 
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The model of a sensation scale goes 
as follows. Corresponding to many 
of the major subjective dimensions of 
change of sensory experience, there 
are primary physical dimensions of 
change (e.g., pitch and frequency, 
loudness and amplitude, etc.). Once 
parametric conditions for significant 
variables have been specified, we 
assume that a single-valued, mono- 
tonic, everywhere differentiable 
(smooth) function exists that relates 
the subjective dimension to its cor- 
responding physical dimension. 
From here on, we shall use the words 
“dimension” and ‘“‘continuum’”’ inter- 
changeably; we shall usually talk 
about a stimulus continuum and its 
corresponding sensory continuum. 

That is the model, and it is very 
easy to state. The big difficulty 
comes when we try to decide how to 
fit data to it. All methods for doing 
this must introduce definitions and as- 
sumptions beyond those listed in the 
previous paragraph. These differ 
from one method to another. 

The oldest sensory scaling method, 
Fechner's, is based upon a further 
condition that says that any jnd on a 
given sensory continuum is subject- 
ively equivalent to any other jnd on 
that continuum. Whether this added 
condition is to be interpreted as 
merely a definition of the scale under 
consideration or as an assumption is 
a matter of opinion. Textbooks usu- 
ally say Fechner “assumed” that all 
jnd’s for a sensory dimension are 
equal to one another (1). It is not 
easy to know what he had in mind, but 
judging by his writings he probably 
did view it as an assumption having 
implications beyond scale construc- 
tion. Since it is not directly observ- 
able and since its indirect conse- 
quences are highly debatable, others 
since Fechner have suggested that it 
might better be viewed as a definition. 
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It is our view that this is the more 
sensible position; it certainly is the 
one for us to take in this paper since 
our point is a logical one, not a sub- 
stantive one. If equality of jnd’s is 
taken as a definition, then it cannot be 
proved or disproved by any kind of 
empirical evidence. An experiment, 
for instance, that showed that a tone 
20 jnd’s loud is not half as loud (ac- 
cording to fractionation judgments) 
as a tone 40 jnd’s loud would not 
have any relevance to what we may 
call Fechner’s definition ; it would only 
show that the kind of sensation scale 
implied by his definition does not 
agree with the kind implied by the 
definitions used in fractionation ex- 
periments. The issue, then, becomes 
what are the different scales useful for, 
and what is their relationship one to 
another. The latter part of this 
paper touches briefly on this problem. 

The main purpose of the paper is to 
explore the consequences of Fechner’s 
definition. Therefore, we must be 
certain of its meaning. It clearly 
does not mean, for instance, that all 
jnd’s for loudness contain the same 
number of physical units. It is only 
on the sensory continuum, not on the 
stimulus continuum, that jnd’s are 
defined as equal to one another. 
Furthermore, this definition holds 
only if all stimulus properties except 
those on the primary stimulus con- 
tinuum remain constant. So there 
js no reason, for instance, to expect 
that the subjective size of a loudness 
jnd at 1,000 cycles per second (cps) 
should be the same as that of a loud- 
ness jnd at 4,000 cps. Of course, it 
would be pleasant if they were equal 
in size, but the model does not require 
it. 
From here on we will talk about 
two kinds of jnd's. A sensation jnd 
is the magnitude of a jnd as measured 
in the units of the appropriate sensa- 
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tion continuum. By definition, all 
sensation jnd’s for a given sensation 
continuum are equal to one another, 
given unchanged values of all stimu- 
lus properties except those on the pri- 
mary stimulus continuum. A stimu- 
lus jnd is the magnitude of the 
change on the primary stimulus con- 
tinuum, measured in appropriate 
physical units, which is just sufficient 
to produce a change of one sensation 
jnd upward at that point. ( A dis- 
cussion of the essentially statistical 
nature of jnd’s appears later in this 
paper.) In general, stimulus jnd’s 
will have different sizes at different 
points on the primary stimulus con- 
tinuum. The rest of this paper will 
not be intelligible unless you keep the 
distinction between these two kinds 
of jnd’s in mind. 

We have assumed that jnd’s are 
measured upward on the stimulus 
continuum. They could also be meas- 
ured downward, and the possibility 
exists that the two measurements 
might not agree. In fact, they are 
certain not to agree if the distance 
spanned is more than two jnd’s, and 
if the size of the jnd at the end where 
measurement starts is used as the 
unit of measurement, since this means 
that the size of the measurement unit 
will be different depending on direc- 
tion of measurement. However, such 
discrepancies might exist in the meas- 
urement of a single jnd; this, if it 
happened, would mean that jnd’s are 
not suitable units of measurement 
unless direction is specified. We have 

therefore confined ourselves to upward 
jnd's. 

Now we can say exactly what this 
paper is about. Given a function 
(obtained from experiment, theory, 
or both) relating stimulus to sensation 
jnd’s for all points of the primary 
stimulus continuum, what may we in- 


fer about the sensory scale implied by 
that jnd function? 

Fechner's derivation of Fechner’s 
law. On October 22, 1850, Fechner 
(2) thought up the first (incorrect) 
answer to the question which ended 
the previous paragraph. Let us call 
any function that gives the size of a 
stimulus jnd at each point of the 
stimulus continuum a Weber function 
(corresponding to “a function relating 
stimulus to sensation jnd’s’’ of the 
Previous section), and any one-to-one 
function based on cumulated jnd’s 
which relates the stimulus continuum 
to a sensory scale a Fechner function 
(corresponding to “a sensation scale” 
of the previous section). These def- 
initions do mot restrict our attention 
to those two special functions which 
have come to be known in psycho- 
physics as Weber's law and F echner’s 
law! Fechner believed that the 
Fechner function corresponding to 
any Weber function could be expressed 
as the solution (integral) of a first 


» Ax, divided by the 
1 € stimulus at that oint, x 
1S a constant (Ax/x = k). E “us 
examine his argument, 
eber's law is true, th i 

er's Jay » then, since 
5 sensation jnd’s are equal by defini- 
ton, there is g constant A such that 
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How did Fechner make this step from 
differences (deltas) to differentials? 
He used what he called a “mathema- 
tical auxiliary principle,” the essence 
of which is that what is true for differ- 
ences as small as jnd’s ought also to be 
true for all smaller differences and so 
true in the limit as they approach 
zero (differentials). If this argument 
were acceptable (which it is not), the 
rest would be simple. Equation 2, 
when integrated, yields the familiar 
logarithmic relationship between sen- 
sation and stimulus which is known as 
Fechner’s law. 

Fechner thought that his general 
procedure ought to be applicable to 
any Weber function, not just to 
Weber's law. It is not. Except for 
a few special cases like Weber’s law, 
the definition of sensation jnd’s as 
equal and the “mathematical auxili- 
ary principle” are mutually contra- 
dictory. For example, consider the 
Weber function Ax/x* = k. Then, 
following Fechner’s procedure, we 
should write: 


Mu A mdo Mat 
Tere andso 4. = 3 
Integrating, we get 
i E 
a 


Let us now check to see whether this 
new Fechner function satisfies the 
definition which says that sensation 
jnd’s are equal to one another. If we 
are at point x on the stimulus contin- 
uum, a stimulus jnd, according to the 
Weber function used in this example, 
is kx. The sensation increment cor- 
responding to this change, the sensa- 
tion jnd at this point, is therefore 


given by: 
ule + kx?) — ule) 


A R 
= Be kx) a x 


pict 
“414+ kx 


which is clearly not a constant for any 
value of the constant A except zero. 

This, although only one example, is 
typical in the sense that almost any 
example you could think of would 
show the same discrepancy. Only for 
a very few Weber functions—some 
pathological ones, Weber's law, and 
its generalization Av = kx + c—does 
the ‘mathematical auxiliary princi- 
ple” yield a Fechner function with 
equal jnd’s. We will not take space 
to prove this formally, but a formal 
proof is available. 

The functional equation solution. 
We have shown that Fechner’s pro- 
cedure involves a self-contradiction. 
We shall show later that it leads to 
wrong results in all important cases 
except Weber's law. Obviously the 
“mathematical auxiliary principle” 
is wrong and must go. 

How, then, should we cumulate 
jnd’s? The simplest, most obvious 
procedure (which has very often been 
used exactly because it is simplest and 
most obvious) is simply to add them 
up one at a time. If the first jnd on 
a primary stimulus continuum is 
stimulus units, then two points on our 
cumulated jnd scale should be 0, 0 and 
1, 5, where the first number is the 
scale value on the y axis and the 
second number is the corresponding 
stimulus value on the x axis. If we 
then find that the size of the stimulus 
jnd at 5 on the stimulus continuum is 
is 8, then the third point is 2, 13. If 
we find that the size of the stimulus 
jnd at 13 is 10, then the fourth point 
js 3, 23, and so on. 

Fechner and some of his more 
modern imitators went way out of 
their way to avoid this simple and 
sensible procedure; in retrospect it is 
hard to decide why they did so. At 
any rate, the next two sections of this 
paper will develop a formal mathema- 
tical solution to Fechner's mathema- 
tical problem—a solution which turns 
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out to be the mathematical equivalent 
of the simple graphical or arithmetic 
technique discussed in the previous 
paragraph. The mathematical prob- 
lem centers about how to fill in the 
curve between the discrete points 
arrived at by the graphical method. 

What mathematical tools can we 
use to replace Fechner’s ‘“‘mathemati- 
cal auxiliary principle”? Equation 1, 
and the corresponding ones based on 
other Weber functions, can be solved 
directly without any mathematical 
auxiliary principles or other further 
assumptions. They are examples of 
what mathematicians call functional 
equations. The papers on which 
most of our discussion is based (5, 6) 
were published in the 1880s, twenty 
years after Fechner first published 
his work. 

The kind of functional equation 
implied by the definition of equality of 
sensation jnd’s is soluble for a very 
wide class of Weber functions. Un- 
fortunately, there is an infinity of 
inherently different solutions to each 
of these equations. However, further 
consideration of what we mean by a 
sensation scale will lead us to proper- 
ties which we usually take for granted 
and which are enough to narrow the 
solutions down to just one interval 
scale, unique except for its zero point 
and unit of measurement. It is inter- 
esting that in the case of the linear 
generalization of Weber's law, and in 
that case only, the f unctional-equation 
solution is the same as that obtained 
by Fechner’s auxiliary principle; for 
all other Weber functions the two 

solutions are different, 

First, we will state 

mathematical problem and its solu- 
tion. Let x, x> 0, denote a typical 
value of the stimulus continuum, and 
let u denote the (unknown) Fechner 
function. Let g be the (given) Weber 
function; i.e., a stimulus Magnitude 


the general 


Y, Y> x, is detected as larger (in a 
statistical sense) than x if y > x +g 
(x), whereas it is not discriminated as 
different from x if x <y <x + g(x). 
We write x + g(x) = f(x). By def- 
inition, a sensory jnd at the sensation 
u(x) is given by the increment? 


ulf (x) ] — u(x) 


(In the usual “delta” notation, g(x) 
= Ax and uff(x)] — u(x) = Au.) 
The condition that sensation jnd’s be 
equal simply means that all sensation 
jnd’s are a constant, which we may 
take to be 1 for convenience, since an 
arbitrary change of unit does not 
matter. Thus, we have our major 
mathematical problem: 


Find those real-valued differ- 
entiable functions u, defined for 
all x > 0, such that ulf(x)] — 
u(x) = 1, for all x0; 


Note that we have said those func- 
tions, not that function, for there may 
be more than one such function. 
This uniqueness question has not 
traditionally been raised, for so long 


as the problem was formulated in 


terms of linear differential equations, 
the uniqueness theorems of that 


branch of mathematics insured only 
one solution. In the realm of func- 


tional equations, we have no such 
assurances, 


It is very lucky that the functional 
equation which has arisen in this prob- 
lem is one of the more famous in the 


* Throughout this paper we shall have to 
use functions of functions. In general, if v 
and w are two real-valued functions of a real 
variable x, v[w(x)] denotes the number ob- 
tained by calculating y = w(x) and then find- 
ing o(y). Clearly, the order of writing v and 
w Is material, for v[w(x)] does not generally 
equal w[v(x) J, Consider, for example, v(x) 
= ax, where a + 1, and w(x) = x?. Then, 


oLew (x) ] = v(x) = ax?, whereas wlo(x)] = 
wax) = ary2, 
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literature; it is called Abel’s equation.’ 
The principal results we shall need 
concerning this equation were pre- 
sented by Koenigs (5, 6) in 1884 and 
1885.4 First, we will present his 
uniqueness results, which illustrate the 
method of attack and lead up to the 
general solution. Suppose that wo(x) 
is a solution to Abel's equation, and 
suppose p(x) is an arbitrary periodic 
function with period 1—in other words, 
any function satisfying 


p(x +1) = p(x) 


K sin 2nx is periodic with period 1, 
and so is an example of a function 
p(x). It is easly to show that the 
function up(x) = wo(x) + pLuo(x)] is 
also a solution to Abel's equation: 


atpLf(x)] = wolf (x) +b (mol) J} 
= 1+u(x) +p +u(x)] 
= 1+10(x)+hLe0(x)] 
= 1+u(x) 


Furthermore, it can be shown that if 
u and u* are two solutions to Abel’s 
equation, then there exists a periodic 
function p with period 1 such that 


u(x) = u* (x) + ple (x)] 


Thus, if we have any solution #o to 
our problem and if we choose p to be a 
differentiable periodic function with 
period 1, then p = %o + p (ato) is also 
differentiable and solves the problem. 

In: the case of Weber's law, we 
have f(x) = kx, k>1, and the dif- 
ferentiable function u(x) = jog k is 


3 Sometimes this equation is spoken of as 
the ‘Abel-Schroder equation, but more often 
Abel’s name is attached to this equation an 
Schroder’s name to the equation off) J = 
cv(x), which arises from Abel's equation 
through the sub: 

4 We are indebtec 


easily shown to satisfy the condition 
of equal sensation jnd’s. Therefore, 


logs , , (logs 
log k log k 


is also a solution if p is differentiable 
and periodic with period 1. 

There is an infinity of such func- 
tions p, and so in infinity of different 
solutions to the problem for any 
Weber function, including Weber’s 
law. This, of course, is quite unsatis- 
factory ; later on we will show that one 
of the properties that we usually at- 
tribute to jnd’s, and which as yet we 
have not used, enables us to insure a 
unique solution. However, first it 
will be useful to present Koenigs’s re- 
sults on the existence of solutions to 
Abel’s equation. 

The existence of solutions to Abel's 
equation. In psychophysical prob- 
lems, there is always @ threshold 
R > 0, such that g(x) is not observ- 
able in the range 0 < x < R. Thus, 
it is only a matter of convenience what 
we assume about the behavior of g 
near 0; we shall suppose that 


g(0) =0 and 0< g'(0) < © 


where g'(x) = a It is known also 


from experimental work that g is 
never 0 and that on the whole it will 
ase with x, except for limited 
ranges of some stimuli, where it may 
decrease slowly. With little or no 
loss of generality, we may suppose it 
never decreases sO rapidly as to have a 
In other words, 


incre 


slope less than —1. 
we also assume: 


g(x) >0 and g(x) >—1 for x>0 


From these assumptions, it follows 
that f(x) = x + g(x)has these prop- 
erties: 
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if strictly monotonic in x, i.e., if 
sigs y, then f(x) < fo); 0 is the 
only fixed point of f (x isa fixed 
point if f(x) = x); and 1 <f’(0) 
<L 0; 


The strict monotonicity of f implies 
that there exists an inverse function 
f~, i.e., a function such that 

PFO E)] = x = fi @)] 


It is easy to show that: 


f~ is strictly monotonic increas- 
ing, « is a fixed point of f- if and 
only if x = 0, 0 < f” (0) < 1. 


Observe that if we know a solution v 
to the equation 


Wf @]=t1+%) [3] 
then « =— v is a solution to 
uLf(e)] = 1 + w(x) [4] 


So it will suffice to deal with mpm ol ty 
in addition to the three properties 
mentioned, f~! is analytic, i.e., if there 
exist constants a; such that 


f(x) = LX ai, 
i=0 

then Koenigs has shown that a differ- 
entiable solution exists to Abel's 
equation. In applicati 
ity is no real restriction. 
ity of notation, let us den 
then Koenigs’ theorem ( 
easy to prove) may be expressed as 
follows: Let h™ denote the 7" iterate 
of h (ie. h™ (x) is the result of n 
successive applications of h 


ons, analytic- 
For simplic- 
ote f by h; 
which is not 


beginning 
at the point x), and let 
h(g) 
x) = lim =—— ZL 
ETOT 


then ¢ exists and is differentiable, and 


— log o(x) 
ea T w (0) 
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is a solution to Abel's Equation 3. 
Therefore, since h’(0) = 1/f’(0), 


a — log g(x) 
MO) = ig 70) 
is a solution to Equation 4 and so to 
our problem. 
The difficult Part of the proof is to 
show that the limit exists, Assuming 
that it does, it is easy to show that 


atts solution. Since hEf(x)] = x, 


. hm 
= {tog lim Tot [or (0) 


og da lim 
AO svn 


haD (x) 9 
wom] / ro 


The evaluation 
for ¢ is rarely a sim 
ple task, Fu 
thermore, the Conditions ich 
t j u 
t has been shown P 


vide a solution to Ab 


ances in whic - 
rone exist For example, T A 
P Ha = ax b xe 1, fails to Satisfy 
1 r yet by direct verifi 

ion one can show th ss 


u(x) = log log [an -1,.9 


log b 
satisfies uola) 


function F(x) ie 


x 

Meet the same 2° also fails to 
; onditio; 

tion n, but a 
Prana exists in this eae A 
found whic offer fun tions can be 
data and w ieee Somate empirical 
ditions, by it 1a. the assumed con- 


e€ seen 
can be evaluated. 
+ 
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The difficulty is, first, in inverting f, 
and second, in finding a simple ex- 
pression for kh. Since this is gener- 
ally difficult, we doubt that the mathe- 
matics of this section will be useful to 
psychophysicists who want a non- 
graphic method for cumulating jnd’s. 
It should be pointed out again that 
for the empirically important Weber 
function g(x) = kx + c the solution is 
known: it is 
wee log (kx + ©) 
log (1 + k) 


A further definition of the sensation 
continuum. So far we have examined 
two formulations of Fechner’s prob- 
lem, both of which are unsatisfactory. 
The first, that of Fechner, contains an 
internal contradiction. The second, 
the functional equation formulation, 
we have shown can be solved. Un- 
fortunately, we have also shown that 
it has infinitely many families of 
different solutions, which is intoler- 
able. In this section we shall propose 
an addition to the second formulation 
which amounts to a method of sum- 
mating jnd’s. We shall show that if 
we demand a particular form of in- 
variance of distances measured in jn 
units, then there is a unique (except 
for zero and unit) sensation scale for 
each of a wide variety of Weber func- 
tions, and for Weber's law this sensa- 
tion scale is Fechner’s law. 

The common psychological custom 
for measuring distances in jnd’s be- 
tween two points is to use the size o. 
the jnd at the lower point as the unit 
of measurement. Although it is rarely 
if ever explicitly stated, it is certainly 
implicitly assumed that if the dis- 
tances ab and cd are both æ stimulus 
jnd’s in length, then they have an 
equal number, say K(a), of sensation 
jnd’s. As a formal mathematical 
condition, this states that 


ulx + ag(%)] — u(x) = Ke) 
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where K is some fixed, but unknown, 
function of œ. It can be shown, first, 
that if u is a solution to this problem, 
dy. 
g(x) 
given by Fechner, but, second, that 
there are no solutions except when 
g(x) = 6x (Weber’s law). We will 
not present a proof of this result since 
it is a blind alley, but we believe that 
it suggests that this customary meas- 
urement of distances should be aban- 


doned. 

We must now 
distances really should 
If x and y are more than one jnd 
apart, we may expect the size of the 
jnd to change as we go from « to y- 
That fact should be taken into account 
in using jnd’s as units of measure- 
ment; failure to take it into account is 
what makes Fechner’s auxiliary prin- 
ciple and the standard measuring pro- 
cedure unacceptable. We shall’ pro- 
ceed to formulate this more sensible 
method of using jnd’s as measuring 
units. 

Let f(x) =x + g(x); then the point 
f(x) is one x-jnd larger than x. The 
point fUf@)J = f® (x) is one f(x)-jnd 
larger than f(x). In general f™ (x) is 
one f" (x)-jnd larger than the point 
for) (x). Clearly, for y > X, we can 
find some integer 7” such that 


fora) S y IE 


then it must be the integral 


consider how such 
be measured. 


and it is reasonable to say that y is 
between 7 and ” + 1 jnd’s larger than 
x, For the moment, let us suppose 
that y was chosen so that y = f™ (x), 
then we can say y is exactly ” jnd's 
larger than x. It seems plausible to 
require that the same be true of the 
sensory continuum, i.e., 


ult” (x) J — u(x) =n 


In words, we are saying that if point 
y is 20 stimulus jnd’s higher than 
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point x on the stimulus continuum, 
then it must also be 20 sensation 
jnd’s higher than point x on the sensa- 
tion continuum. If the above condi- 
tion is met for n = 1 (in other words, 
if all sensation jnd’s for a given sensory 
continuum are equal), then it must 
also be met for all larger values of z, 
since 


ul f (x)] — u(x) 


= m fD (x) ]} — u(x) 
= 1+ uff" (x)] — u(x) 


=n 


But this takes care of relatively few 
points, and does not allow us to say 
exactly how many jnd’s y is from x un- 
less the difference is a whole number of 
jnd’s. We must find a definition 
which tells us how to subdivide a jnd 
into fractional parts. How to do this 
is not obvious, since the definition of 
distances given above involves iterates 
of f, and these are apparently defined 
only for integers, Fortunately, it is 
possible to generalize the notion of 
an iterate to arbitrary, rather than 
integral, indices. This problem is 
closely related to that of Abel’s func- 
tional equation which Koenigs ex- 


amined; we shall be able to use his 
results, 


First, we can set up some properties 
that a generalized iterate f(x), 
where £ is any non-negative number, 
should meet. In essence, they 
amount to stipulating that f(x) 
should coincide with the usual defini- 
tion when £ is an integer and that the 
same law of composition should hold. 
Formally, it is sufficient to require 


that 
JOR) = x, FR) = f(x) 


and for every s and t> 0, 


LO) = FOF Oe] 
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For integers, the generalized iterate 
coincides with the usual notion, as you 
can see, by repeatedly applying the 
last condition to the second one. 

We have already presented a result 
of Koenigs which showed that if f is 
strictly monotonic and analytic, 1 < 
f'(0) < œ, and 0 is the only fixed 
point of f, then there exists a function 
$ defined in terms of the iterates of 
f> such that 


log (x) 
log f’(0) 
is a basic solution to Abel’s equation. 


This means that ¢ is itself a solution 
to what is called Schroder’s equation 


of (x) ] = F O)o(e) 
which is obtained from Abel's by tak- 
ing exponentials on both sides. Using 
this fact and following Koenigs, it is 


easy to show that ¢~ exists and that 
the function 


FO%) = eE (0) J'e (x)} 


satisfies the three conditions of a gen- 


eralized iterate. We show the latter, 
First, 


u(x) = 


SC) = $[6(w)] = x 


Second, 


FO (x) = Lf’ (0)d(x)] 
And so, using the fact that ¢ satisfies 
Schroder’s equation, 
Lf (x)] = f'0)6(x) = olf (x)] 
Hence, 
F (x) = f(x) 
Finally, 


FOO] 
ORCO LIOA 
=O MO OTE 0)]'e()} 
=P LF (0) P49 (x) } 
=f (x) 
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So, with this definition of the gen- 
eralized iterate, we can generalize the 
above definition of distances in jnd’s 
to prescribe how to deal with fractional 
jnd's. 

We reformulate our major mathe- 
matical problem: 


Given a Weber function g 
which is analytic, g'(x) >—1 
for all x > 0, g’(0) > 0, and g(0) 
= 0, to find those functions u(x) 
such that u[f‘(x)] — u(x) = t, 
for allx > 0, and all ¢ > 0, where 
f® is the generalized iterate of 


f(x) = x + g(x). 


Note that by setting ¢ = 1, this con- 
dition implies the equality of sensation 

jnd’s. 
À log ¢(x) 
First, we show that uo(x) = 4779) 
o(x) jog /”(0) 


solves the reformulated problem: 


uf (x) ] — wole) 


_ log off (x)] _ log $(x) 
~ log f’(0) log f’(0) 
_ log {Lf’(0) Jo(«)} — log (x) 
log f’ 0) 
_ tlog f’(0) 
log f’ (0) 
=t 


Second, from the results about Abel's 
equation, we know that if there are 
any other solutions to this problem, 
they must be of the form tp = o + 
p(uo), where p is periodic with period 
1. For wtp actually to solve the re- 
formulated problem, it is necessary 


that for every t Z 0, 


his upLf (x) J — up(x) 
= wolf? (x)] + p{uoLf (x) J} 
— uo(x) — pluo(x) I 


= i 4 ple + uo(x)] — pE] 


Thus, for every t> 0, 
pLt + u0(x)] = pLvo(x)] 


That is, p must be periodic with every 
period £, and so pisa constant. Thus, 
up to an additive constant, wo is the 
unique function which solves our re- 
formulated problem. 

In nonmathematical language, in- 
troducing the method of measuring 
fractional jnd’s has enabled us to 
eliminate all solutions to Abel’s equa- 
tion save %o, thus cutting down the 
number of acceptable solutions from 
infinity to one. 

We conclude, therefore, that the 
condition stated in the reformulated 
problem constitutes an acceptable 
definition of a psychophysical sensa- 
tion continuum, in the sense that it 
yields a unique Fechner function for 
any reasonable Weber function. We 
also find that for Weber’s law this 
condition yields Fechner’s law. The 
solution of our reformulated problem 
may cause unhappiness because it 
is not the same as the integral “‘solu- 
tion” proposed by Fechner, except in 
the special case of the linear general- 
ization of Weber's law. However, 
we have already shown that the inte- 
gral “solution” contradicts the defini- 
tion of equal sensation jnd’s. 

It is sad that the integral is not the 
right solution, for its evaluation is 
often easy, and we fear that noworking 
psychophysicist will find in our math- 
ematics a tool for determing a sum- 
mated jnd scale any better or more 
efficient than the simple graphic pro- 
cedure of adding jnd’s up one at a 
time. 

The statistical nature of jnd's. So 
far we have sounded as though we 
were treating jnd’s as fixed quantities, 
although every psychophysicist knows 
that jnd’s are statistical fictions, de- 
fined by an arbitrarily chosen cutoff 


on a cumulative frequency curve. 
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However, we now show that our 
method of reducing the infinity of 
solutions to Abel’s equation to one is 
equivalent to treating jnd’s as just 
such statistical fictions. 

We start with the old, famous psy- 
chological rule of thumb: equally often 
noticed differences are equal, unless 
always or never noticed. We define 
P(y,x) as the probability that y is 
discriminated as larger thanx. N ow, 
this rule of thumb simply means that 
on the sensation continuum the func- 
tion P(y,«) is transformed in such a 
way that it no longer depends on x and 
y separately, but only on the differ- 
ence of their transformed values, 
Put another way, the subjective con- 
tinuum v is a strictly monotonic trans- 
formation of the stimulus continuum 
such that the probability that a 
change of 6 units on the sensation 
scale will be detected depends only 
upon 6, and not on the place at which 

6 begins or ends. 

Formally, if we are at a point x of 
the stimulus continuum, and there- 
fore at u(x) on the sensation scale, 
and if a stimulus y is presented such 
that u(y) = u(x) + 5, then the chance 
that y will be detected depends upon 
ô, but not on x. If we note that 


Y = aufa) + 5] 
then the condition is that 


PuLu(xe) + ô], x} = P(6) 


Our Problem is to decide under what 
Conditions this problem has a solution 
and what that solution is. To this 
end, we make the assumption that for 
each x, P (y,x) is a strictly monotonic 
increasing function of y. 

We show the following: If the 
above problem has a solution, then 
there exists a function F(x) such that 
PLF(x), x] is independent of Xi 
where f(x) is the tà iterate of f(x) 
previously defined. The function 


f(«) — x is a Weber function natu- 
rally defined in terms of P. If there 
is a solution, it is unique and it is the 
solution o to Abel’s equation uf (x) ] 
— u(x) = 1. Inother words, if there 
is any solution to the problem of 
equally often noticed differences being 
equal, then it is unique and it is the 
solution to our proposed reformula- 
tion of Fechner’s problem. 

The proof is comparatively simple 
and runs as follows. Suppose there 
exists a solution u to the condition 
that P{u-[u(x) + 6],x} is inde- 
pendent of x for all 6>0. Since P 
is strictly montonic in y for all x; 
there is a unique solution to P(y,x) 
=k for each k, 0 <k <1: call it 
yY =fk(x). For any ô, let k = P(5), 
and so by our assumption u 


must 
satisfy 


u[u (se) +] = fx) 


where we have written ts 
Applying u to this, we have 


uLfs(x)] — u(x) = è 
Let f = fı. We observe that if = 0, 


then fo(x) = x, Suppose we choose 
any ô, e> Oand let y = F(x); then 


è = ulfs(y)] — u(y) 


= uff fee) ]} — uLfe(x)] 
ulfelfe(x)]} — u(x) — e 


for fro. 


Thus, 
ulfolf-(x)]} — u(x) = 6 + e 
But, from above, 


UL fose(x)] — u(x) =S+.¢ 


so 


ul foe(x)] = ul fol fe (x) J} 
whence 
Fore(x) = falfe(x)] 


we have shown that fs must 
the three conditions of a gen- 


Thus, 
satisfy 
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eralized iterate of f, ie. fs =f for 
all 6, so a necessary condition for a sol- 


ution is that 
PEF (x), x] 


shall be independent of x. From the 
fact that u[fs(x)] — u) = 6 = 
ulf(x)] — u(x), it follows that the 
solution is uniqe and that it is the 
same as that given for our reformula- 
tion of Fechner’s problem. 

It probably is not obvious, but the 
point of this section extends beyond 
sensory psychophysics into the scaling 
procedures based on Thurstone’s law 
of comparative judgment. Case V of 
that law is based on the assumption 
that equally often noticed differences 
are equal unless always or never 
noticed. This fact has two interest- 
ing implications. The first and more 
obvious one is that these two appar- 
ently different branches of psycho- 
ological measurements are actually 
doing the same thing (namely, using 
a measure of confusion as a unit of 
measurement by assuming that con- 
fusion is equal at all places on the sub- 
jective scale). The second, less obvi- 
ous implication is that perhaps sensory 
psychophysics can profit by consider- 
ing, as Thurstone and his followers 
have, scaling methods with less rigid 
assumptions which nevertheless are 
based on confusability data. One of 
us (Luce) will pursue this possibility 
further in a forthcoming book (7). 

Graphic methods for cumulating jnd’s. 
Psychophysical data do not come in 
mathematical form. In order to ap- 
ply our method for cumulating jnd’s 
(or Fechner’s, for that matter), it is 
necessary either to put the Weber 
function into equation form, or else 
to develop a graphic equivalent of the 
appropriate mathematical operations. 
The graphical equivalent of Fechner's 
technique is well known, although 
rarely used (see, &-8+ 15, pp. 94 and 


Fic. 1. How to cumulate jnd’s. The size 


of the jnd at the origin is marked off on the x 
axis to find point A, the size of the jnd at A is 
marked off to find point B, and soon. The 
stimulus values A, B,C, . ss correspond to 
the points 1, 2, 3, . - - om the cumulated jnd 


scale. 


147-148). It is, of course, wrong, 
since Fechner’s technique is wrong. 
If our technique is to be of greatest 
applicability, we should provide a 
graphic equivalent also. Unfortu- 
nately, it seems difficult to find a truly 
convenient one. The only method we 
know of is to go back to the basic idea 
of adding up jnd’s—the idea that one 
jnd plus one jnd is two jnd’s. The 
method of applying this basic idea is 
given in Figure 1, and was discussed 
earlier in the paper- Its error char- 
acteristics are abour the same as 
those of the graphic techniques of 
integration which have been used in 
the past. Unfortunately, the method 
is tedious; if there are 170 jnd’s 
between absolute threshold and the 
upper limit of discrimination, then 170 
separate operations are required to 
determine the cumulated-jnd_ scale. 
The errors in these successive oper- 
ations do not multiply, however. 
Practial effects of the new procedure. 
No doubt it is important to under- 
stand Fechner’s logical error and to 
know how to avoid it, but the burning 
question for working psychophysicists 
is: What, if anything, does this do to 
the currently accepted conclusions 
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about the uselessness of adding up 
jnd’s? 


First, it is easy to show that under 
some circumstances the difference 


betwean integration ail the fuie- 
tional-eqnation solution is substan- 
tial. Consider the glass of Weber 
funetlens gle) = axli: if a is greater 
than zero, the asymptotic error of 
the integral solution as # approaches 
infinity is infinite; while if e is less than 
Jor, the avytriptatie cerot ie zua. 
Of course, if e equals zero (Weber's 
law), the two procedures give identical 
results. The order of magnitude of 
the error for small numbers of jnd’s 
depends on the constants in the equas 
tion; it can be of significant size even 
if e is less than zero. One way of 
looking at it is that the integral solu- 
tion is the approximation given by 
the first two terms of a Taylor series 
expansion of the functional equation; 
all square and higher power terms of 
the expansion are omitted: 


ule + g(x)] — u(x) = 1 
= u(r) + 0'(«)g(e) 


ge)? 


+ w(x) oT Frey = H(i) 
= u'(x)g (x) 
MAT 


A number of 


An 3 experimental deter- 
minations of jnd's, Particularly for 
intensive continua, produce a curve of 
B(x) /x that first falls and then is flat 
=i function often well approximated 
by g(x) = ke + c, However, for 
some continua the picture is less sim- 
ple. ere are some (pitch, for ex- 
ample) where the curye appears to 
rise again at the high end. The fall- 
ing section of these curves corresponds 
to the case e < 0; the flat section cor- 
responds to the case e = 0; the rising 
section corresponds to the case e > 0. 


However, the x-axis of such graphs 
is usually plotted logarithmically. 


This means that the rising seetion may 
cover most of the range within which 


thé alliiulus can be varted—a tact 
which the logarithmic v-axis tends te 
conceal, Se it is quite possible that 
the error in using the integration tech- 
nique is substantial for many sense 
modalities and for large ranges within 
each. 

But the possibility of error is ir- 
relevant unless someone has actually 


made the error, Hag anyone? Ex: 


tensive examination of the literature 
Suggests that the answer is that not 


very many such érors have occurred. 
Some authors are quite unclear about 
how they added up jnd’s, but many of 
them have preferred the step-by-step 
method which corresponds to the 
functional-equation solution because 
it was very simple to do, How 
simple it is, of course, depends on the 
number of jnd’s to be added; we 
doubt very much if the jnd's for piteh 
will ever be added this way, since there 
are several thousand of them. We 
have found only one clear instance 
(15) in which the graphie equivalent 
of integration has been used (to cum- 
wate pitch jnd's, as it happens), 
thoug 1 it has been vigorously recom- 
mended. The general avoidance of 
the graphic equivalent of integration 
may be caused by shrewd intuition 
that something is wrong with Fech- 
ner's mathematical auxiliary princi- 
ple, Or it may simply be a rare 
instance in which the fear of mathe- 


matical complexity has benefited sci- 
ence, 


Do cumulated jnd's agree with other 


scales? The results of cumulating 
jnd's have often been compared with 
the results of other psychophysical 
procedures (4). The most common 
finding has been that the cumulated 
jnd scales do not ; gree with scales de- 


| 
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termined by fractionation or direct 
magnitude estimation, at least for 
such continua as loudness. A review 
Of This literature Might esem appropri- 
ate here, but it is quite innecessary, 
since the relation between scales based 
on confusion data (like cumulated 
jnd scales) and those based on frac- 
tionation or magnitude estimation 
has been extensively and excellently 
discussed in recent studies by Stevens 

(14), Stevens and Galanter (16), an 
Piéron (8, 9, 10). 

The controversy over the relation 
between cumulated jnd scales and 
scales determined by other methods is 
embedded in a larger, sometimes acri- 
monious controversy about the rela- 
tionships among various methods of 
sensory scaling. To some extent we 
shall have to enter the fray. 

; The first and most important ques- 
tion is this: Do the different scaling 
procedures, if properly used, lead to 
different scales? Unless we reject a 
great many experiments as improperly 
performed, we must answer "Yes 
But the issue is not as simple or un- 
ambiguous as that answer: Fer e~ 
ample, Garner (3) has developed a 
loudness scale based on both fraction- 


ation and multiseotion judgments that 
fits a large number of experimenta 
results in auditory psyehophysie 

better than does the old sone $ a 
(his paper was written prior to the 
development of the new sone scale 
13 ]), Figure 2 shows the relation- 
ship between that scale and a Cumt? 
lated jnd scale for loudness prepare! 
by us from Riesz’s data (11, 12), The 
two scales seem to be roughly linearly 
does it mean anything 


related—but t mean é 
for the controversy! Riesz's proce- 


dure has often been criticized, and his 
data are almost 30 years old. The 
form of Garner's scale (which is all 
that matters for this argument) 1s 
based primarily OP his multisection 


i — wn) = p f "i 0 
3 f f | 
i ZA tae 
L 5 a e i 
4 ae E Pe co 
a al . 


Pe 
Try a a do aa 18 00 IP aT "A Oe o o ma wo 
mama ect 


Fie. 2. Tho relation between Gamers 


loudness scale and Riesz’s cumulate j 
scale, The old sone scale (ASA loudness 


scale) and Stevens's recent revision of it are 


included for comparison. 


rather than his fractionation data: 
Scales based upon multisection data 
usually agree with thase constructed 
by confusability methods; the explan- 
ation proposed by critics of these 
methods is that the adjustment of five 
or six stimuli in a multisection experi- 
ment may produce confusion among 
the tones being adjusted. If this 
argument is correct, and if the form 
of Garnet's scale is based upon multi: 
me ti data, it a surprising that 
the two agree: HF reason for 50 
extensive a discussion ol Gattiét’s 
scale is that loudness is the central 
fatdleerouid of this controversy: H 
the verdict of psychophysica history 
ig that confusability and multisection 
anales give results different {rom frac- 
tionation results for loudness, then 
psychologists will amost certainly 
assume that the two procedures yield 
different results in other intensive (or, 
as Stevens calls them, Class I or 
prothetic) continua. Unfortunately, 
even in psychophysics, not enough uni- 
versally accepted data are available to 
settle the argument. 

If confusability scales and scales 
based upon fractionation or direct 
magnitude estimation agree, no prob- 


lem arises. If not (and we suspect 
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they will not), psychophysicists must 
still evaluate each kind of procedure 
and its resulting scale. Some psycho- 
physicists feel that fractionation and 
magnitude estimation have great face 
validity, and that confusability scales 
are distortions of the scales obtained 
by these procedures. ‘They say that 
fractionation and estimation scales 
correspond to what Ss say they feel, 
they are obtained by straightforward 
procedures rather than indirect ones, 
and, after all, what logic is there in 
basing a measure of magnitude on 
variance or “noise.” 

Other psychophysicists feel that 
confusability scaling is the better 
method. They say that fractionation 
and estimation data are unreliable, 
variable, and, as a rule, at least frac- 


tlonatlon data cannot be turned into 
seales unless obtained from a “good,” 
which means extensively trained, sub- 
ject. The estimation techniques have 
not been used enough times in enough 
Haces to indicate clearly what effect, 

ay, (tailing May have on the re- 
sults. Confusability scales can be 
obtained from untrained 58 who have 
fio idea what form of scale is wanted 
from them; they can even be obtained 
from animals. 

Each group asserts that its preferred 
scales are more nearly consistent with 
the bulk 9f peyehophyaieal data than 
the other kind of scales; each group 
can produce impressive arguments to 


buttress its elaim. 


till another position is possible: 
perhaps two different kinds pf sen: 


sory processes are being tapped by 
these two different kinds of proce- 
dures, If so, both kinds of scales are 
useful, but for different purposes, 
This could well be the eventual end- 
point af the argument, 
7 source o sian i 

i aote i the REUT oF it 


dividual differences: The custom has 


been to take means or medians, and 
recently a number of psychophysicists 
have raised vigorous questions about 
the appropriateness of doing so. 
W. J. McGill’ is currently attempting 
to find a better way of respecting in- 
dividual differences while still obtain- 
ing a “universal” scale. It will be 
interesting to see what light serious 
attempts to do justice to individual 
differences sheds on the differences 
between the two classes of scales. 

The status of cumulated jnd’s has 
been controversial for more than a 

undred years, and this paper is not 
intended as an attempt to settle the 
controversy. Our main point is that 
Fechner'’s problem has been improp- 
erly formulated and that the integral 
usually offered as a solution is not in 
fact a solution when the Weber func- 
tion differs from g(x) = kx 4+-¢. We 
haye also developed what appears to 

e the correct solution, only to find 
that ia computational worl it has 
usually been used in spite of its dis: 
agreement with the integral solution, 
This means that our clarification of 
the logical issues underlying Fech- 
ner’s formulation does little to change 
the status of the present, primarily 
empirical, controversy about scaling 
methods. However, one of us (Luce 
C7) has recently developed a way oj 
dealing with eotifusability data base 
on a simple axiom which, if it wars 
out eee ly, may resolve the 
difficulty by changing our ideas about 
the meaning of econfusability scales; 


this dsvelopment will ba described t 


another publication. 


SUMMARY 
Feshners method fer adding up 


jnst noticeable differences (ad's) i 


fain sensory seales is based oH a 


iat J: MeGill, Personal communieation 
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mathematical error: he used a differ- 
ential equation approximation to a 
functional equation instead of the 
functional equation itself. The func- 
tional equation Can, however, be 
solved directly- The solution coin- 
cides with the differential equation 
solution only in the special case in 
which the linear generalization of 
Weber's law holds exactly. The 
mathematical properties of the formal 
golution are such that it probably 
will not be very useful for practical 
computation, but the extremely sim- 
ple graphical procedure of adding up 
jnd's one at a time is the graphical 
equivalent of the mathematically 
correct solution. The amount of 
difference between the two proce- 
dures can be calculated for some 
special cases; its size depends on the 
form of the function relating size of 
jnid's to stimulus magnitude. 

This error does not seem to have 
any significant impact upon the con- 
lovelsy over the relation between 


cumulated jnd scales and scales ba: 
gn fractionation and direct estimation 


data because most peychophysicists 
have, in fact, ignored the recom- 
mended (incorrect) procedure and 
have stubbornly summated jnd’s in 
the obvious and correct way. 
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STRENGTH OF CARDIAC CONDITIONED RESPONSES WITH 
VARYING UNCONDITIONED STIMULUS DURATIONS + 
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University of Connecticut 


The effects of long and short shocks 
on the conditioning of fear have been 
frequently considered important tests of 
theory of how fear is learned (1, 2, 3, 4, 
9,10). N. E. Miller, for example, has 
stated that according to a strict drive- 
reduction theory of reinforcement “. . , 
other things equal, a signal followed by 
a brief noxious stimulus should acquire 
the capacity to elicit stronger fear than 
one followed by a prolonged noxious 
stimulus” (2, p. 375). Fig 
Despite its theoretical significance, 
there have been few direct experimental 
attacks on this problem. We have found 
only two published conditioning studies 
with duration of noxious US as a major 
variable, one by Bitterman, Reed and 
Krauskopf (1) and the other by Mowrer 
and Solomon (4). Each of these stud- 
ies examined only two values of shock 
duration over a relatively narrow range. 
More extensive data on this problem 
can be extracted from a series of cardiac 
conditioning experiments we have done 
exploring the effects of duration of shock 
on the form of the conditioned heart re- 


yet unreported, on t 
duration to stre 
of CR. 

If heart rate chan 
tion of a shock is a 
of fear, our classic 


ge during anticipa- 
ccepted as an index 
al conditioning stud- 


1This is Technical Report N 
Contract Nonr-631 (00) between 
Naval Research and the J 
necticut. Reproduction in 
is permitted for any purp 
States Government. 

2 Now at the University of Maryland, 


0. 15 under 
the Office of 
niversity of Con- 
Whole or in part 
ose of the United 


ies using shocks of 0.1, 2.0, 6.0, and 15 
sec. allow us to make a direct and 
thorough empirical check on Miller’s 
statement of the drive reduction view. 
We shall first summarize our own ex- 
periments. 


METHOD 
General 


The apparatus, procedure, and Ss have all 
been described in detail in previous papers 
(8, 9, 10). They are summarized as follows, 


Following 
nditioning 
trials consisting of the tone CS followed in 6 


Heart rate was measured with 


an electrocardi- 
ograph before, during, and aft 


er each trial, 
Measures of CR and UR 


Since individual differences in the form of 
conditioned cardiac responses occur, a measure 
of heart rate disturbance was chosen which 
would be comparable for the different types of 
response, This was a variability measure, the 
Statistical range of heart rates during the con- 
ditioning (posttone) period. 
derived as follows. 
conditioning trial eac! 
tween the tone and k 


rate measure. There were usually about eight 
or nine of these, The difference between the 
largest and smallest of them defined the range 


for that trial, a measure which we previously 
have designate, 


The same 
pute the Max, 
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after the shock onset are used instead of the 
eight or nine before. 


RESULTS 


Figure 1 presents the Maximum CR 
and Maximum UR for four groups hav- 
ing different shock durations. Each 
point is the mean of 15 Ss’ data (3 Ss, 
selected without bias, were omitted to 
equalize the Ws in the groups for statis- 
tical convenience). Maximum CR does 
not seem to vary systematically as a 
function of shock US duration. What 
fluctuations do occur are easily attribu- 
table to the corresponding variation in 
Maximum UR. This fact is established 
by Table 1, which gives the results of 
an analysis of covariance of the Maxi- 
mum CR function, using Maximum UR 
as the relevant covariate. When the in- 
fluence of the unconditioned response is 
extracted from the conditioned response 
there is left an insignificant amount of 
CR variation among the shock condi- 
tions. 

The absence of significant differences 
among the group means is not to be in- 
terpreted as due to absence of condition- 
ing. Evidence has already been pre- 
sented (8, 9, 10) to show that a sig- 
nificant amount of cardiac conditioning 


o 


a 


HEART RATE DISTURBANCE (B/M) 


n 


20 c3 r. 150 


DURATION OF SHOCK US (SEC) 


Fic. 1. The maximum amplitudes (range) 
of CR and UR are plotted as a function of 
shock US duration. One standard deviation 
is indicated by & vertical line on either side of 


each of the mean values comprising the points 


of the CR function. 


TABLE 1 


SUMMARY OF ANALYSIS OF COVARIANCE OF 
Maxmus CR AMPLITUDE WITH 
Maximum UR AMPLITUDE 


Source df M? F 
Between 
durations 3 766 -68 
Within 52 1131 


For 3 and 52 df, F is 2.79 for p = 5%. 
The variate-covariate correlation is -}.57. 


took place in all four groups, using other 
(correlated) measures of response. To 
demonstrate further that the CRs and 
URs in Fig. 1 represent reliable dis- 
turbances in rate, we have compared 
them with similar measures of pretone 
variability, i.e., the range of ten pretone 
rate measures. Table 2 presents the 
results of a series of ¢ tests of the sig- 
nificance of the differences between mean 
pretone range and mean posttone range 
(for CR), and between mean pretone 
range and postshock (for UR). Each £ 
is based on 15 differences, each differ- 
ence itself a mean (over 10 trials) for 
each of 15 Ss. Pretone means were 
small, less than 1.5 B/M for all groups, 
in comparison to the posttone and post- 
shock means shown in Fig. 1. 


Discussion 


We wish to discuss our results briefly 
in relation (a) to previous findings and 
(b) to Miller’s theoretical statement. 


TABLE 2 


PROBABILITY OF CHANCE OCCURRENCE OF MEAN 
F} DIFFERENCES BETWEEN PRETONE CARDIAC 
DISTURBANCE AND BOTH CR anp UR 
FOR THE Four DURATIONS 
or Sock US 


Shock Duration (Sec.) 


0.1 2.0 6.0 15 
CR 01 01 02 01 
UR 001 001 01 001 
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The flat gradient in Fig. 1 relating 
strength of CR to US duration is not 
inconsistent with the findings of Bitter- 
man, Reed, and Krauskopf (1), who 
found no significant difference in the 
strength of conditioned GSR with shock 
duration of .5 and 3 sec. This lack of 
difference, however, may have appeared 
because they used the same Ss for both 
durations. Their experiment employed 
different signals for short and long 
shocks; and, if the Ss discriminated the 
signals verbally, they were assumed not 
to have generalized autonomically (i.e., 
GSR). The weakness of this assump- 
tion is shown by the fact, reported by 
Notterman, Schoenfeld and Bersh (5), 
that Ss who state they expect no shocks 
during experimental extinction continue 
to show conditioned cardiac disturbance. 

Mowrer and Solomon (4) paired a 
light signal with shocks of 3 sec. for 
one group of rats and with shocks of 

10 sec. for another group. The strength 

of fear conditioned to the light was 

measured by the capacity of the light 
to act as effective punishment of a lever- 

Pressing response. For these two dura- 

tions of shock, an abrupt onset and 

termination of shock was employed. No 
significant difference between the two 
groups was found with Tespect to their 
inferred fear. Two further groups were 
run under a similar Procedure, except 
that shock was terminated gradually 
rather than abruptly. For one group 
the shock was 4 sec. long and for the 
other, 7 sec. No reliable difference in 
fear was reported between these groups 
either. Despite the wide differences in 
Ss, procedures, and methods of inferring 
fear, the results of the Mowrer and 
Solomon study are in agreement with 
our own findings. Such agreement gives 
us confidence that there is no relation- 
ship between indices of fear and noxious 
stimulus durations. 

The discrepancy between the empirical 
findings and Miller’s theoretical formu- 


lation of the drive-reduction view de- 
Serves comment. It will be noted that 
Miller prefaced his statement that short 
noxious stimuli would produce more fear 
than long ones with the phrase, “other 
things equal.” There are at least two 
important “other things” here; the first 
is strength of US, or more precisely, 
subjective strength. This amounts to 
a statement of the need to control the 
strength of the shock, or other noxious 
stimulus, as it feels to the subject. If, 
through temporal summation, the longer 
shocks are felt as stronger, then stronger 
fear might result from them despite the 
correctness of Miller’s statement, be- 
cause more intense unconditioned stimuli 
are known to create stronger CRs (6, 7), 
The second important factor in need of 
control is adaptation to shock. If there 
is a period of rapid adaptation during 
the early part of a prolonged shock, it 
might produce the major part of drive 
reduction necessary to condition fear. 
An effect of this kind shortly after shock 
onset could make the length of the physi- 
cal shock quite irrelevant to the timing 
of the drive reduction, 

With respect to the first point, tem- 
poral Summation, we have one way of 


e of the uncondi- 
Our results show that 
there were no Systematic differences in 
the magnitude of UR for the different 
shock durations, Hence, we have some 
evidence, at least, that the over-all sub- 
jective intensity of the US was rela- 
tively constant for the y, 


conditioned response 
as a relevant datum. It is true that 


\ 
Ww 


# 
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adaptation to the shock occurs if mo- 
mentary magnitude of the UR reveals 
subjective shock intensity. We have 
shown (8, 9, 10) that regardless of shock 
duration in the 0.1 — 15 sec. range the 
heart-rate response to shock takes the 
form of a rapid acceleration to a maxi- 
mum rate within 4 beats of shock on- 
set, followed by a slow irregular return 
(adaptation) to normal requiring more 
than 20 beats. If the drive property 
of shock parallels the course of UR mag- 
nitude in time, we would be led to say 
that drive reduction is remarkably un- 
related to shock duration in the range 
investigated. In terms of Miller’s ap- 
plication of Hull’s theory, this would 
mean that no differences in condition- 
ing of fear should occur because sub- 
jective shock intensity reduction (the 
presumed reinforcing agent) was equally 
delayed for all groups. If this line of 
reasoning is correct, our experiment un- 
fortunately provides no real test of a 
theory that predicts the consequences of 
varying delays of reinforcement. 

This difficulty, moreover, may not be 
restricted to the present experiment. As 
long as there is no demonstrable rela- 
tionship between objective and subjec- 
tive properties of shock, the possibility 
of a simple experimental test of Miller’s 
assertion is precluded. Use of a stronger 
shock (if Ss would endure long dura- 
tions) might conceivably eliminate the 
troublesome adaptation effect, but only 
at the cost of a corresponding increase 
in the likelihood of destroying our pres- 
ent control of temporal summation. 

The present data may be accounted 
for by two-factor or contiguity theories 
which stress the importance of the on- 
set rather than the offset of shock (3, 
4), but the data are not critical tests of 


this type of theory. 


SUMMARY 


p has been found in 


No relationshi ARET 


human Ss between conditione: 


rate response magnitude and a wide 
range of shock US durations. 

Under the assumption that cardiac 
disturbance is an index of fear, this fact 
is related to a deduction from a drive- 
reduction theory of fear, according to 
which a signal followed by a brief nox- 
ious stimulus should require the capacity 
to elicit stronger fear than one followed 
by a prolonged noxious stimulus. It is 
concluded that this proposition is either 
incorrect or not testable under the con- 


ditions provided. 
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STIMULUS AND RESPONSE GENERALIZATION: 


DEDUCTION OF THE GENERALIZATION 


GRADIENT FROM 


ROGER N. 


A TRACE MODEL? 


SHEPARD 


Harvard University? 


It is now generally acknowledged 
that (a) a response conditioned to one 
stimulus tends also to occur to other 
stimuli, and (6) the magnitude of this 
response-tendency (for any particular 
one of those stimuli) is governed by 
the dissimilarity between that stimu- 
lus and the stimulus to which the 
response was originally conditioned. 
Indeed, this principle of stimulus gen- 
eralization is of such fundamental 
importance that any quantitative 
theory of behavior that fails to deal 
with it explicitly can only be regarded 
as incomplete. It is not Surprising, 

therefore, that a number of investiga- 
tions have been specifically concerned 
with the form of the function relating 
generalized response-tendency to in- 
terstimulus dissimilarity, What may 
seem surprising, though, is that the 
conclusions of these various studies, 
far from converging on a unique func. 
tion, have diverged to the several 
different functions illustrated in Fig. 
1. Beyond those who doubt the 
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existence of a quantitatively invariant Ng 
“gradient of generalization” in the 
first place (5, 16, 21), there are those, 
like Schlosberg and Solomon, who 
consider this gradient to be linear, as 
in A (8, 14, 22, 23) ; those, like Spence, 
who consider it to be convex upward, 
as in B (27, 28); and those, like Hull, 
who consider it to be concave upward, 
as in C (1, 11, 12, 19, 20). In addi- 
tion, if the “discriminal dispersion” of 
the psychophysicists can be inter- 
preted as a gradient of generalization, 
there is support for a bell-shaped func. 
tion which is first convex and then 
concave upward as shown in D (9, pp. 
317-319), 

In addition to stimulus generaliza- 
tion, an analogous phenomenon of 
response generalization js sometimes 
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been conditioned tends also to evoke 
other responses, and (b) the magnitude 
of this tendency (for any particular 
one of those responses) is governed by 
the dissimilarity between that re- 
sponse and the response originally 
conditioned. Although this principle 
of response generalization is also of 
considerable theoretical importance, 
even less progress has been made to- 
ward the quantitative determination 
of the shape of the gradient in this 
case than in the case of stimulus gen- 
eralization. 

Actually, that a unique gradient of 
generalization has failed to emerge 
both in studies of stimulus generaliza- 
tion and in studies of response general- 
ization no longer seems so surprising 
when one examines these studies in 
detail. For although the obtained 
gradient must depend crucially upon 
the choice of the independent variable 
(i.e., dissimilarity), the various in- 
vestigators have never been able to 
agree on any one measure of dissimi- 
Furthermore, 
since tests for generalization have been 
conducted during various stages of 
discrimination learning, as well as 
after various periods of extinction, 
one must consider the possibility that 
the form of the gradient may change 

radically under different conditions of 
reinforcement (13, 31). 

This paper will attempt to show 
that, by approaching the generaliza- 
tion problem from a somewhat differ- 
ent direction, considerable evidence 
can be adduced for the proposition 
that the gradients of stimulus and re- 
sponse generalization both conform to 
an exponential decay function (Curve 
C of Fig. 1). Further evidence will 
be presented to indicate that the form 
of the gradient does indeed depend 
upon the schedule of reinforcement 
and, more particularly, that it changes 

from the exponential to a bell-shaped 


function (Curve D) as nonreinforced 
trials are introduced with greater and 
greater frequency. 


THE EXPONENTIAL GRADIENT IN 
ParreD-AssociaTE LEARNING 


In view of the difficulties attending 
any effort to establish one measure of 
dissimilarity as a standard, the follow- 
ing strategy was recently proposed : 
Stimuli (or responses) were con- 
ceptualized as points in a “psychologi- 
cal space” in such a way that the dis- 
tance between any pair of these points 
represented the psychological dissimi- 
larity between the corresponding pair 
of stimuli (or responses). By taking 
account of a set of metric axioms which 
any measure of distance should satisfy, 
it was shown that hypotheses about 
the shape of the gradient of generaliza- 
tion could be tested without resorting 
to an independent measure of dissimi- 
larity (25). In a series of experiments 
on stimulus and response generaliza- 
tion during paired-associate learning, 
substantial support was obtained for 
the hypothesis that generalization is 
an exponential decay function of psy- 
chological distance (26). Since alter- 
native hypotheses were not tested, 
however, it seems desirable to present 
data in a form that will clearly rule 
out the Functions A, B, and D of Fig. 
i 

Now in a paired-associate experi- 
ment there are V stimuli, Si, Sas e» 
Sy, to which are assigned JV responses, 
R;, Ro, . - - Ry. On each trial one of 
the N stimuli is presented to the sub- 
ject who must in turn produce one of 
the N responses. Various procedures 
of differential reinforcement can be 
used to communicate to the subject 
the prevailing assignment of the re- 
sponses to the stimuli. Sometimes 
the subject is simply informed after 
each trial whether his response was or 
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was not the correct one (ie, the one 
assigned to the stimulus presented on 
that trial) ; at other times more elabo- 
rate methods of “correction” are used 
(26). In any case the so-called as- 
signment is simply a rule which the 
experimenter follows in delivering the 
reinforcements, and any arbitrary 
one-to-one assignment can be estab- 
lished in this way. 

Consider then a paired-associate 
experiment in which the responses are 
highly distinctive and so lead to neg- 
ligible amounts of generalization. If 
there are N pairs consisting of one 
stimulus and its assigned response, 
the data from such an experiment can 
be represented by an N X N matrix 
giving, for every S; and Sp, the condi- 
‘tional probability P&S with which S; 
leads to the response assigned to S;, 

(25). A very basic measure of stimu- 
lus generalization between S; and S; is 
then provided by either of the prob- 

abilities Py® or Py, Comparison 
between different experiments is facil- 
itated, however, by adjusting these 
measures so that the generalization 

etween any stimulus and itself is 
always unity. This can be done by 
replacing the absolute probabilities 
Pas and P:S with the ratios Pas/ 
Pi® and Pyi8/Py%. Furthermore, 
these two ratios can be averaged to- 
gether to furnish a single, more stable 
estimate of the generalization between 
Siand Sy. Since there are theoretical 
reasons for preferring the geometric 
to the arithmetic mean (25), the meas- 


ure of stimulus generalization might 
be defined by the formula 


Peering 
Py SP, 8 
PPS 


Owing to the gradual manner in 
which differential reinforcement takes 
effect, however, there is an tnitial 


phase during which the subject re- 
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sponds more or less randomly, This 
means that the function given in the 
above formula necessarily levels off at 
some positive asymptote for large 
interstimulus distances. Again, in 
order to compare data from different 
experiments, this asymptote must be 
brought down to zero for each experi- 
ment. This is accomplished by esti- 
mating a parameter C5 from each set 


of data (25), and by defining the gen- 
eralization between S; and S, to be 


GaS = (1 + CS) 
| PuSP,58 
K ee y] 
Pus PiE G al 
Likewise, in an experiment with 
highly discriminable stimuli, the re- 


sponse generalization between R; and 
R; will be given by 


Ga? = (1 + CP) 


Pat Pt =, 
x pap am 
y Pač Py" [2] 
where Pit is the conditional probabil- 


ity of Ry given the stimulus assigned 
to R; (25). 


Equations 1 and 2 then specify the 
dependent variables for paired-associ- 
ate experiments, i.e., stimulus and re- 
sponse generalization. With regard 
to the independent variables, i.e., 
interstimulus and interresponse dis- 
tances, it is clear that physical meas- 
ures are not directly applicable. 
Thus the psychological distance be- 
tween two tones at a fixed difference 
in intensity changes as both tones are 
increased in intensity, But this 
does not mean that there is no relation 
between physical and psychological 
distance. On the contrary, at least 
the following two statements can be 
made: (a) two tones which are brought 
arbitrarily close together in terms of 
physical measures also approach each 
other psychologically ; (b) as two tones 


Ez 
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separated by a fixed difference in 
intensity are increased in intensity, 
the psychological distance (although 
it does not remain fixed) nevertheless 
changes in a gradual and continuous 
manner. These considerations form 
a basis for the assumption that the 
locus of a set of stimuli in psychologi- 
cal space can always be obtained from 
their locus in physical space by a 
transformation that is (a) continuous 
and (0) differentiable (25). 

Although the stimuli and responses 
chosen for experiments on paired- 
associate learning have typically been 
words or nonsense syllables, nonverbal 
materials can be used just as well. 
Indeed, for the purposes of establish- 
ing a measure of the psychological 
distance between stimuli, it is espe- 
cially convenient to choose stimuli 
(such as tones differing only in in- 
tensity) that can be varied along a 
single physical dimension. In gen- 
eral, of course, stimuli that are evenly 
spaced along a single physical dimen- 
sion will be neither evenly spaced nor 
confined to a straight line in psy- 
chological space. Nevertheless, it fol- 
lows from the assumptions of contin- 
uity and differentiability that, if 
stimuli are evenly spaced over 4 
sufficiently small range of a single 
physical dimension, then they will be 
spaced approximately evenly along an 
approximately straight line in psy- 
chological space. 

Suppose, then, that the N stimuli of 
a paired-associate experiment satisly 
this special condition which, for pres 
ent purposes, will be called the linear- 
ity condition, Such stimuli can be 
designated as Si, Sz + - - Sy in such 
a way that the subscripts correspon 
to the ordinal positions of the N 
stimuli along the common (approxl- 
mately straight) line. The average 
generalization for all pairs of stimuli 
just D steps apart along this line will 


then be given by 


G(D) = ya Gi 


(withi-k =D) [3] 


where, as indicated, the summation is 
carried out over the N — D stimulus 
pairs separated by just Dsteps. Sim- 
ilarly, if the responses meet the linear- 
ity condition, this formula can be used 
as a measure of the average general- 
ization for all pairs of responses sep- 
arated by D steps. Thus, Ga in 
Formula 3 can stand for either Giz’ or 
Gi®. 

Now the average generalization 
G(D) and the separation D have 
properties that particularly recom- 
mend them for investigations of the 
relation between generalization and 
distance. First of all, D is quite in- 
sensitive to the physical measure used 
to space the stimuli evenly along the 
chosen dimension. ‘Thus, if the stim- 
uli are squares differing only in size, 
it does not much matter whether these 
squares are spaced in accordance with 
constant increments of area or length 
of side. It is a consequence o! 
linearity condition that these two 
variables (a? and a) will be almost 
linearly related within the small range 
of variation permitted along the size 
dimension. In fact, any variable that 
is equivalent to these variables except 
for a continuous, differentiable trans- 
formation could presumably be used. 
Thus the stimuli could just as well be 
regularly spaced in terms of measures 
based upon psychophysical procedures 
such as the summation of jn 's, cate- 
gory judgment, magnitude estimation, 
etc. (29). All of these measures are 
continuous and differentiable func- 
tions of the usual physical variables. 
This indifference of D to the vari- 
able chosen for evenly spacing the 
stimuli is further enhanced by com- 
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Fic. 2. An exponential decay function 
fitted to data from several experiments in- 
volving stimulus or response generalization 
during paired-associate learning. The center 
of each plotted symbol (triangle, square, 
circle) has average generalization G(D) as 
ordinate and distance D (multiplied by a con- 
stant factor) as abscissa. The experiments 
were conducted by Attneave (1), McGuire 
(18, 26), and Shepard (26). 
ment with circles of variable col 
were not confined to a single p 
sion. 


n obtained 
by Torgerson for these same colors (30, Ch. 
tained in a 
ogical space, 
assume that Torgerson’s 
judgmental method of triads yields satisfac- 


tory estimates of psychological distance in the 
present sense. In the experiment on response 
generalization, it had been found that the 
two end-responses did not conform to the 
linearity requirement (26). The plotted 
data were therefore based upon the generaliza- 
tion between the seven intermediate responses 
only, Finally, Attneave’s experiments devi- 
ated from the design Presupposed by the pres- 
ent analysis in the following respects. (a) 
The stimuli were quite widely spaced and so 
may not have met the linearity requirement. 
(0) The number of trial 
than in the other experiments. Thi 
the steady-state condition assumed in the en- 
suing theoretical discussion seem a little un- 
likely. (c) Since Attneave reported only the 
sums PiS + Pus but not the individual 


relatively small region of psychol 
it seems safe to 
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puting G(D) as an average for all 
pairs of stimuli separated by D steps. 
Thus, even if there is some residual 
systematic contraction or expansion 
of psychological distance, as one pro- 
ceeds from the first to the last pair of 
stimuli at a given separation D, this 
systematic effect will be largely can- 
celed out by averaging both ends of 
the stimulus range together. 

In Fig, 2 average generalization 
G(D) is plotted against D for several 
experiments on the learning of paired 
associates. In each experiment sey- 
eral different random assignments of 
the responses to the stimuli were em- 
ployed. Since the average psycho- 
logical spacing of the stimuli and re- 
sponses varies somewhat from experi- 
ment to experiment, the D-values for 
each experiment are multiplied by a 
constant factor to make them compar- 
able with the D-values for the other 
experiments. The Conclusion is clear: 


empirical Points fall 


an exponential decay 
function (the fitted curve), t 


hey devi- 
ate markedly from the alternative 
Functions A, B, and D of Fig. 1, 


DEDUCTION op THE EXPONENTIAL 
GRADIENT FROM A TRACE 
ODEL 
The principal pur 
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del to account for 


probabilities, it was necessary to substitute 
the approximate formula (Pas + Pi;8)/ 
(Pus + Pix) for the geometric mean used in 

However, in a previous investi- 


r statistical stability 
(d) There 
v to make the estima- 
tion of the constants CS feasible. For the 
data, therefore, CS 
That the agreement 
of data is good de- 
optimum conditions 
es suggests that the data 
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the empirically determined exponen- 
tial form of the gradient of general- 
ization in terms of a hypothetical 
trace process. The aim of such a 
proposal is twofold : First, it is desired 
to remove the apparent arbitrariness 
of the exponential gradient by showing 
that it follows from certain elementary 
assumptions of a more intuitively 
compelling character. Second, it is 
hoped that such a model will provide 
for prediction to other experiments in 
which the usual paired-associate con- 
ditions no longer prevail. 

E he model is suggested by recogni- 
tion experiments of the following kind. 
A subject is shown a certain square 
and, after a delay of ¢ units of time, 
shown a second square which may or 
may not be the same as the first. In 
this situation the probability of re- 
sponding “same” is distributed, with 
respect to the difference in size between 
the first and second squares, according 
to some bell-shaped density function. 
Since the time error is usually small, 
the mode falls near the zero-difference 
point. The variance of the distribu- 
tion, however, increases appreciably 
with the delay ¢ imposed between the 
first and second exposure (2, 3, 10, 17). 

The unidimensional stimuli, Si, S2 

. . . Sy, in an experiment on paired- 
associate learning, will be designated 
by small circles arranged in a vertica 
row along the left, as in A of Fig. 3. 
Corresponding to these, there are con- 
ceived the internal representations 
(perceptions), Si*, S2*, Sy", 
which will be designated by the small 
circles on the right. Whenever 4 
stimulus S; is presented, it leads to 
some one of the internal representa- 
tions S:*, with probability Pw- 
the responses are so distinctive as 
never to be confused, reinforcement 


e only if a stimulus is fol- 


will ensur 
presenta- 


lowed by its corresponding re 
tion (25). 
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Fic. 3. The diffusion and deconditioning 
of the stimulus trace. The arrows connecting 
a single stimulus S to the alternative percep- 
tual representations S* in A, B, and C illus- 
trate how the trace elements are assumed to 
spread out with time. In D, E, and F the 
process is represented as continuous. 


Suppose, then, that when S; leads to 
S;*, a large number of trace elements 
are conditioned (by reinforcement) 
from S; to S*. This bundle of trace 
elements or “stimulus trace” is des- 
ignated by the arrow in A. Immedi- 
ately following the removal of the ex- 
ternal stimulus S;, the trace elements 
are subject to haphazard perturba- 
tions so that some of the conditioned 
trace elements wander to adjacent 
stimulus representations as shown in 
B. (One might imagine here the 


action on the synapses of the random 


molecular processes associated with 
some of 


metabolism.) Later still, 
these elements will wander even fur- 
ther from S;*, as shown in C. 

Now for any two stimuli differing 
along a single physical dimension, 
another stimulus can be found which 
is situated intermediately between 
these. Thus, as indicated in D, a 
continuum may be conceived as under- 
lying the corresponding internal repre- 
sentations. This continuum then is 
the psychological space of the stimuli. 
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After removal of the stimulus, the dis- 
tribution of trace elements progres- 
sively spreads out in this Space, as 
illustrated in D, E, and F, 

_In addition to this spread or diffu- 
sion of the trace, it is assumed that the 
trace elements are also subject to 
Spontaneous deconditioning (again, 
Presumably owing to random proc- 
esses at the molecular level), Thus, 
as the elapsed time increases, the 
number of elements still conditioned 
decays to zero, In Fig. 3, the shaded 
areas represent the fraction of the 
original trace elements which are still 
conditioned at each time. 


The Deconditioning of the Trace 


The probability that a given condi- 
tioned element will suffer decondi- 
tioning during a small interval Aż will 
be denoted by U(Al). The simplest 
rule that can be assumed to govern 
this probability is as follows: 


logical space) to which the element has 
drifted in that time. 


The probability that an element, 


still conditioned at time 4, remains 


t+ At is, of course, 
Ls U(At). Therefore, the probabil- 


venient, therefore, to define 
a deconditioning Parameter, 


en 
Pas [5] 
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which does not depend upon the choice 


of an arbitrary interval At. Then, if 
the axis is translated so that condi- 


tioning takes place at £ = 0, m can be 
increased in such a way that, as At > 
0, m-At— ¢. 

Although the probability that a 


given trace element is deconditioned at 
Precisely time ¢ is zero, the probability 


Per unit time (the “probability den- 


sity”) or rate of deconditioning for an 
individual trace element is, at the 


particular instant £, 
lim [1 — U(A) JU (44)/At 
Ato 


But by Equation 5 as Aż 0, UAD 
U- At — Ut/m. Therefore, the rate 
of deconditioning at time ¢ is, for 
single trace elements, j 


lim [1 — Ut/m y 


= Uexp(— UN [6] 
where, for convenience jn wh 
s at fol- 
lows, exp E Ut) 1S used in place of 
na 


Assumption IT $ 
further reinforceme 


t 
U(t) =1 -Í U exp (— Ur)dr 
= exp (~ Ui [7] 
The Diffusion of the Trace 


_ Equation 7 completes the quantita- 
tive formulation of the deconditioning 
of the trace. In order to provide a 
similar formulation for the diffusion 
of the trace, it is necessary to consider 
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the motions of the individual trace 
elements in psychological space, ‘The 
exposition is simplified by continuing 
to suppose that the stimuli are evenly 
spaced along a restricted range of a 
single physical dimension. It is then 
possible to introduce one coordinate 
x; for each stimulus representation S# 
giving its position along a one-dimen- 
sional psychological space. The psy- 
chological distance between any two 
stimuli, S; and Sx, is then 


Da = |æ: -e £l [8] 


Now the expression Vix(Ax, At) 
will denote the probability that, if at 
time ża trace element is situated at x:, 
by time ż -+ At it will have moved into 
the one-dimensional region bounded 
by xz and x + Ax. As before, the 
arbitrary interval Ax can be eliminated 


by defining a new quantity 


Yate) = ia ESA 
Aro Ax 

Va (At) is the probability density 
of displacements from x; to x, during 
the brief interval Az. The simplest 
rule that can reasonably be assume 
to govern this quantity is as follows: 

Assumption III. Va,(At) is an 
invariant function of the psychologi- 
cal distance between S; and Sk It 
does not depend upon the time ¢ or 
upon the absolute position of the pair, 
S; and S;, in psychological space. 

If the x-axis is translated so that x; 
= 0, it follows from Assumption III 
that, for a given interval Aż, there 
exists a fixed function fan such that 


Val At) = fa(Da) = faelxel [10] 


However, it will not be necessary to 
make any particular assumption about 
the form of the function far. It 1s 
only necessary to insure that, during 
a short period of time, the probability 
of a very large displacement is negli- 


gible. This can be stated with greater 
precision as follows: 

Assumption IV. The function fa: 
and, thus, the probability density of 
displacements of a trace element from 
x = 0 is distributed over the x-axis 
with finite variance. 

From Assumptions III and IV it 
follows that the variance of the dis- 
tribution of displacements during an 
interval At is the finite constant 


V(At) = f +e eVa Ade [11] 
-0 


Since V (Aż) must depend upon the 
length of the interval At, it is useful to 
define a diffusion parameter, 


. V(At) 
W= flin aa [12] 
At At 


which does not require the stipulation 
of an interval At. Just as U governs 
the rate of deconditioning of the trace, 
then, V controls the rate of spread or 
diffusion of the trace in psychological 
space. 

The next question to be answered is 
this : Given the diffusion parameter V, 
what form will the distribution of 
trace elements take after an appreci- 
able delay ¢? It is possible to show 
that the assumptions which have been 
made are sufficient conditions for the 
desired distribution to tend toward a 
limiting form which is independent of 
the form of fa: (6, 15). Specifically, 
if a trace comprising a large number 
(n) of elements is conditioned from S; 
to x; at ¢ = 0, the density of these 
elements at x, for some later time ¢ 
conforms with the Gaussian function 


n: Valt) = n: (2r Vi) 
X exp [— (Dx)?/2Vt] [13] 


The beauty of this result is its com- 
plete independence from the under- 
lying mechanism symbolized by fat. 
Thus, even though one imagines, for 
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example, that thermal agitation of the 
molecular substrate is responsible for 
deconditioning and diffusion, the bio- 
physical details of this Process need 
not be specified. For, according to 
the present formulation, these details 
are irrelevant to the question of the 
gross behavior of the trace system. 


The Trace Process in Paired-A ssociate 
Learning 


Now in the course of learning paired 
associates, each stimulus S; will have 
been presented on many occasions. 
Furthermore, on a number of these 
occasions, reinforcement of the re- 
sponse assigned to S; will have condi- 
tioned a bundle of x trace elements to 
S;*. Thus at some given time fo, the 
density of conditioned elements re- 
sulting from the immediately preced- 
ing reinforcement will be distributed 
in psychological space as shown for 
tı in Fig. 4, Likewise, the densities 
‘remaining from earlier reinforcements 
will be distributed as shown for ta, 
t_s, and so on. 

_ The total distribution of condi- 
tioned elements emanating from Sat 
to can be found by summing the 
Gaussian distributions resulting from 
all previous reinforcements of the re- 
Sponse assigned toS, Itis possible to 
derive an analytic approximation to 
this composite distribution (J: in 

by Supposing that, after an 
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Fic. 4. A series of Gaussian 

with increasing variances, 

distribution arising from 
these over time, 
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initial phase of rapid learning, rein- 
forcements occur at relatively fre- 
quent and regular intervals. If the 
early trials are disregarded, then, a 
roughly steady-state Process can be 
considered. The summation of the 
Gaussian curves of various ages can 
then be approximated by an integra- 
tion of these curves over £. 

Now, with respect to S;, the density 
of trace elements at X; resulting from 
a reinforcement { time units ago is 
given by n: Viz(t). 
the fraction U(t) of these is still con- 
ditioned. Therefore 
elements at x, which remain condi- 
tioned after a delay ¢ from reinforce- 
ment is 2- U(t)- Vin (t). 


cord with 
f n: U(t): Va (t)dt 


= S e (- Ut): (2r Vi) 
‘exp [— (Di,)2/2 Ve jdt [14] 


Fortunately, 
effected (4, p, 


n(2UV)-3 
exp [~ Da(2U/V)] [15] 


For intermediate 


the integration can be 


144) and, indeed, yields 


continuum, It is 
density, however, 
U <1) the inte; 
tion over the 
greater than 
Multiplication 

/n converts th 


not a probability 
Since (if n> 1 or 
ration of this func- 
x-axis yields a value 
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ability density, 


Pa = (U/2V) 
-exp [— Du(2U/V)#] [16] 


for then 


Now Equation 16 furnishes an esti- 
mate of the conditional probability 
(per unit x) of the particular percep- 
tion xz, given the stimulus S. In 
order to secure the probability of 
taking S; to be S; (through stimulus 
generalization), S,* can be reinterpre- 
ted as a finite region partitioned off 
from psychological space in the neigh- 
borhood of x. In this way the entire 
one-dimensional space can be divided 
into N mutually exclusive and ex- 
haustive segments so that each seg- 
ment corresponds to one of the ex- 
ternal stimuli. A given trace element 
is then said to be conditioned to S,* at 
time żif and only if it falls in the region 
containing x; at that time. 

Px, reinterpreted in this way, can 
be taken as an approximation to the 
conditional probability that the ex- 
ternal stimulus S; will lead to the 
internal representation S,*. If the 
early trials are ignored (since these 
trials do not sufficiently approach a 
steady state), the constant C% in 
Equation 1 can be disregarded (25). 
Setting CS = 0, and substituting the 
right-hand member of Equation 1 
for Pas in Equation 1, then, the 
generalization between S; and Sx as- 
sumes the remarkably simple form 


GaS = exp (— Diz vV2U/V) [18] 


Letting « stand for the constant 
¥2U/V and averaging GaS over all 
pairs of stimuli separated by a fixed 
distance D, Equation 3 now takes the 


[19] 


form 
G(D) = e? 
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But if x is identified with the constant 
distance multiplier calculated for each 
experiment, this is precisely the ex- 
ponential function fitted to the empir- 
ical data in Fig. 2. 


Further Aspects of the Trace Process 


The role of deconditioning. It might 
have seemed unnecessary to include 
the deconditioning assumptions (I and 
IJ) along with the diffusion assump- 
tions (III and IV), since the process of 
diffusion alone would account for the 
spread of the trace, and hence for 
stimulus generalization. However, 
from Equation 18 it is clear that, with- 
out deconditioning, generalization 
would be so extensive as to prohibit 
the learning of paired associates. 
For if U = 0, Gy = 1 for all t and k. 
In this case the gradient is perfectly 
flat, and discrimination between stim- 
uli is impossible. This is a conse- 
quence of carrying the integration of 
Equation 14 out to infinite ¢. Alter- 
natively, one could integrate only out 
to some finite valuet = T. However, 
in terms of the model, this is equiva- 
lent to assuming that the independent 
elements simultaneously suffer de- 
conditioning at the same instant T 
time units from conditioning. Ra- 
ther than postulate coincidences of 
this kind, it seems more plausible to 
assume the gradual kind of fadingaway 
implied by Assumptions I and II. 
This fading away (or forgetting) then 
serves the adaptive function of weight- 
ing old, diffuse traces less heavily than 
new, accurate traces. 

Of course, the integration of Equa- 
tion 14 out to infinite ¢ is not strictly 
justified, since the learning experi- 
ment itself proceeds for only a finite 
period. The error introduced by the 
infinite integration is small, however, 
if the deconditioning parameter U is 
sufficiently greater than zero. For 
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then the hypothetical traces persist- 
ing from reinforced trials that might 
be imagined as preceding the actual 
beginning of the experiment would be 
almost totally deconditioned during 
all but the early trials of the learning 
experiment. 

Asymmetric generalization. It has 
sometimes been suggested that there 
exist asymmetries in which a general- 
ization going in one direction, e.g., 
Si — S,*, is more probable than a gen- 
eralization going in the reverse direc- 
tion, S > S. At first glance, such 
a possibility appears to violate As- 
sumption IH, acéarding to which the 
probability of a displacement for a 
given trace element depends only 
upon the length (and not upon the 
direction) of that displacement, 
However, an account of such asym- 
metries which is consistent with As- 
sumption III is suggested by the 
analysis proposed by Bush and Mos 
teller. Aevarding to thee model, the 
psychological dissimilarity going from 
S: to S, is equal to the dissimilarity 
going from S, to S: only if the “Set 

comprising S; 
a as the set of 
stimulus elements comprising S, (5). 
In terms of the trace model, then, it 
can be supposed that the area of the 


space des- 


the region 
Since the prob- 


m) will occupy a 
igan Interval At ig 
) the area of that 
region, Pa” does not necessarily equal 
Pys, Indeed, the weight WS, pro- 
posed earlier by Shepard (25), is 
Presumably a measure sf the area of 
the region corresponding to 9, This 
interpretation Suggests why the emi- 
pirically obtained weights tended to 
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be greatest for stimuli or responses at 
the end of a linear array (26). 
Multidimensional generalization. 
For expository reasons the stimuli 
were supposed to vary along a re- 
stricted range of a single physical di- 
mension. Actually, by making use 
of the theory of random motions in 
Euclidean spaces of more than one 
dimension (6), it can be shown that 
the trace model leads to the same ex- 
ponential gradient in either of the 
two following multidimensional cases: 
(a) the psychological space is Euclid- 
ean; (b) the stimuli are confined to a 
small region of psychological space, 
(The second case follows from the 
first. For, by the hypothesized rela- 
ical and Psychologi- 
cal space, a sufficiently small region 


Euclidean.) ‘The 
treatment of generalization aver large 
distances iq non-Euclidean spaces, 
however, a 


trace model 


1 the case of 
response generalization, Suppose, for 


example, that the Tesponses are closely 
spaced along a single physical dimen- 
sion, whereas the stimuli are com- 
pletely discriminable. When one re- 
sponse R; is intended, some other 
response R, may actually be made, 
Using a notation analogous to that 
adopted for stimulus generalization, 
it can be said that R; leads to R, with 
probability Pa. If then R;* leads 
te Ra the ensuing Feinlorcenient 
conditions a Jarge number of trace 
elements from R* to a point æ; in the 
psychological Space of the responses. 
These conditioned trace elements are 


be 
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then subject to the rules already set 
forth in Assumptions I through IV. 


Tur Form OF THE GRADIENT AND 
THE SCHEDULE OF REINFORCEMENT 


In the last section the apparent 
arbitrariness of the specific function 
fitted to the paired-associate data of 
Fig. 2 was reduced by showing that 
this function is a consequence of 
elementary assumptions which do not 
involve the specification of any partic- 
ular function. The purpose of this 
section is to determine whether these 
game assumptions have any conse- 
quences for diferent types of 2Xpen» 
ments in which the sequence of rein- 
forced and nonreinforced trials is 
manipulated, as in the study by Hum- 
phreys (13). 

Certainly, from Fig. 4 it is clear 
that the sharp peak of the combined 
gradient at *—9 is contributed 
solely by the distributions of trace 
elements resulting from the most re- 
cent reinforcements (like the dis- 
tribution £1). Therefore, if feedback 
as to the correctness of each response 
is terminated, the composite gradient 
should become rounded (convex Up- 
ward) in the vicinity of « = 0 ant 
under continued extinction, gradually 
flatten out. Likewise, if reinforce- 
ment is delivered only for very nth 
correct response, the composite grati- 
ent should assume an intermediate 
form which tends more toward either 


an exponential or a bell-shaped curve 


as n is made smaller or larger- on- 
siderations of this kind may account 
that, 


for the finding of Humphreys 
whereas with 100 per cent reinforce- 
ment the generalization gradient was 
concave upward, with 50 per cent 
peliloreement it was initially convex 
upward (33)... ee 

Tt is also feasible to analyze data 


from paired-associate experiments 
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Fic. 5. The gradient of generalization as 
a function of the number of trials intervening 
between a response and the last preceding 
feedback as to the correctness of that response. 
The numbers of intervening trials are grouped 
together as follows: I. 0 trials; Il, 1-3 trials; 
III. 4-8 trials; IV. 9-15 trials; and V. 16 or 


mare trials. 


with 100 per cent reinforcement for 
this effect. Since the stimuli are pre- 
sented in random order, the delay be- 
tween successive occurrences of a 
given stimulus or response varies over 
a considerable range. Thus separate 
gradients can be plotted depending on 
the number of trials intervening be- 
tween a given stimulus-response se- 
e and the most recent feedback 
concerning the correctness of that 
particular sequence. The most €x- 
tensive data currently available in a 
form suitable for this analysis come 
from an experiment on response gen- 
eralization in paired-associate learning 
(26). Those data are therefore re- 
analyzed and plotted in Fig. 5. As 
redicted from the model, the gradient 
of generalization systematically 
changes from concave to convex up- 
ward in the vieinity of the correct 
se ag the number ol trials inter- 
yening between a response an è 


quence 
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last differential reinforcement of that 
response is increased. l 
These results may help to explain 
why the so-called “discriminal dis- 
persion” observed in absolute-judg- 
ment and identification experiments 
seems to conform to a Gaussian or 
normal density function (9, pp. 317- 
319). Since differential reinforce- 
ments are not usually provided in 
these experiments, the discriminal dis- 
persion is presumably maintained by 
those haphazard reinforcements of 
everyday existence which antedate 
the beginning of the experiment 
proper. Under these circumstances 
the gradient must be relatively meso- 
kurtic, as illustrated in V of Fig. 5. 
This gradient resembles a Gaussian 
function, and is quite unlike the com- 
paratively leptokurtic gradients (such 
as I) which have been shaped by re- 
cent differential reinforcement., 


MICROMECHANICAL AND MACROME- 
CHANICAL MODELS FOR GENERAL- 
IZATION 


The present trace model might be 
termed a micromechanical model be- 
cause it is based upon assumptions 
about the fine-grain or “microscopic” 
mechanics of the underlying trace 
Process. This model is to be dis- 
rlier macrome- 
from assump- 
tions pertaining to large-scale or 

“macroscopic” aspects of generaliza- 
tion (25). However, to the extent 
that the two models are compatible, 
they can be integrated so that the 
macromechanical assumptions of the 
earlier model wil] appear as deductions 
from the more primitive microme- 
chanical assumptions of the present 
model. The aim of such an integra- 
tion would be to broaden the scope of 
the earlier model. F or, whereas the 
macromechanical model applies only 
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to paired-associate experiments with 
continual differentia] reinforcement, 
the micromechanical model has im- 
plications for a considerably wider 
range of experiments. 

The three basic assumptions from 
which the macromechanical model was 
originally derived are as follows. (a) 
Stimulus and response generalizations 
take place independently of each 
other. (5) The probability of a 
stimulus generalization is an expo- 
nential decay function of the psycho- 
logical distance between the stimuli, 
(c) The probability of a response gen- 
eralization is an exponential decay 
function of the psychological distance 
between the responses. Now to say 


is to 
anical as- 


1 ns c) will retain 
their original form only under cer- 


tain limiting conditions (e.g., under 
reinforcement), When 
these conditions are modified (as 
when the reinforcements are delivered 
only intermittently), Propositions } 
and c will h 


forms in accordance with the con- 


al reinforcement, 
paired-associate 
macromechani- 
cal assumptions would have a different 
the microme- 


Specifically, if 


small, the trace model does 
directly to the exponential gradient. 
Because occasional Sequences of the 
form Sim eee R:* + R; will be 
reinforced, trace elements will be con- 
ditioned from S; to x, (rather than 
only to x;). Events of this kind will 
somewhat alter the form of the gradi- 
ent. (In a previous experiment with 
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generalization, both between the stim- 
uli and between the responses, no 
significant departure from prediction 
on the assumption of an exponential 
gradient was observed [26]. How- 
ever, the theoretically expected devi- 
ations would be quite small and may 
have been obscured by the rather 
large variability of the data from that 


experiment.) 
Detailed derivations from the mi- 


cromechanical assumptions are quite. 


complicated in the case of simultane- 
ous stimulus and response generaliza- 
tion, owing to the circumstance that, 
in this case, the form of the gradient 
depends upon the particular stimulus- 
response assignment enforced. Part 
of this complication is connected with 
the absence in both models of any 
account of the decrease in generaliza- 
tion which necessarily accompanies 
learning. An entirely satisfactory 
treatment of these problems will 
probably require an even more basic 
integration of these models with the 
already extensively developed models 
for learning per se. 


SUMMARY 


The problem of the relation between 
generalization and dissimilarity (iéa 
the problem of the shape of the 
“gradient of generalization”) is, 16 
examined in the light of recent the- 
oretical and empirical developments. 
With regard to experimental arrange- 
ments in which reinforcements are de- 
livered in accordance with a one-to- 
one assignment of the responses to the 
stimuli (as in paired-associate learn- 
ing), the following conclusions are 
drawn: 

1. Measures of generalization can 
be defined in terms of the conditiona 
probabilities with which the various 
stimuli lead to the various responses. 

2. Thus defined, stimulus general- 
ization and response generalization are 
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both invariant functions of inter- 
stimulus and interresponse dissimilari- 
ties, respectively, provided that two 
conditions are met. First, dissimi- 
larity is reinterpreted to mean a 
“psychological distance” which (a) 
is equivalent to “physical distance” 
except for a continuous, differentiable 
transformation, and (b) satisfies the 
metric axioms. Second, a given 
schedule of reinforcement is main- 
tained. 

3. Under conditions of frequent 
and regular reinforcement (as in the 
typical paired-associate experiment), 
the gradient of generalization is 
closely approximated by an exponen- 
tial decay function (concave upward). 

4. Under conditions of infrequent 
or intermittent reinforcement, this 
gradient departs from the exponential 
function in that it is convex upward in 
the vicinity of the reinforced stimulus 
or response. 

5. The empirically observed gradi- 
ents of generalization can be deduced 
from a mathematical model based upon 
four elementary assumptions COn- 


‘cerning the temporal decay of stimu- 
- lus and response traces. 
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THE EFFECTS OF EARLY HANDLING ON VIABILITY 
OF THE ALBINO RAT? 


EVERETT W. BOVARD 


Montreal Neurological Institute 


This paper concerns effects of early 
handling on viability of the albino rat. 
The relation of early experience, par- 
ticularly handling, to the response to 
stress will be considered first; then neu- 
rophysiological mechanisms mediating 
the observed effects of handling on via- 
bility of the rat will be explored. 

Experiments by Selye (60, 61) in 
1936 showed that a wide variety of 
noxious agents and stimuli—e.g., poi- 
sons, ionizing. radiation, extreme cold, 


excessive exercise, surgical injury, and 


emotional excitement—produce a non- 
specific, systemic response in the verte- 
brate organism. This adaptive response 
involves release of adrenocorticotrophic 
hormone (ACTH) from the anterior 
pituitary, and consequent release of ad- 
renal cortical hormones, such as corti- 


1This material is based, in part, on a re- 
port to the National Research Council of Can- 
ada, Jan. 3, 1953, and on a paper subsequently 
presented to the Canadian Psychological As- 
sociation, May 28, 1953, at Kingston, Ontario. 
Related research at the University of Toronto 
was made possible by Grant A.P. 20 from the 
National Research Council of Canada to the 
writer, 1952-1954, and by the friendly and 
continued support of E. A. Bott, Head, De- 
partment of Psychology; University of To- 
ronto. Preparation of this paper was made 
possible by a grant from the Research in Hu- 
manities Fund of the University of Toronto. 
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sone and 17-hydroxycorticosterone, into 
the blood stream. ‘The general effect 
of these adrenocortical hormones is to 
maintain blood pressure and blood-sugar 
levels, thus counteracting shock effects 
in injury and trauma. These hormones 
stimulate metabolism at the cellular 
level? and, in general, their end-effects 
are protein catabolic, involving break- 


‘down of protein into glucose. 


There appears to be general agree- 
ment in 1958 that almost any type of 
disturbing stimulus induces a pituitary- 
adrenal “response to stress in the rat, 
and that this reaction is essential to pre- 
serving the animal’s metabolic and cir- 
culatory integrity under stress, as Engel 
stated in 1953 (14). 

The work of Selye (60) at Montreal 
has indicated that prolonged stress, like 
cold or excessive exercise, will lead to 
degenerative renal and cardiovascular 
changes in the rat and, if sufficiently 
prolonged, to the subsequent death of 
the animal. A number of studies in the 


2 A stress not encountered in nature is oxy- 
high pressure. Under this stress, pres- 
f the adrenals is detrimental to survival 
These and similar experi- 
ments (3, 16) suggest that the normal action 
of the adrenal cortex is to enhance metabolic 
processes, particularly oxidation, at the cellu- 


lar level. 


gen at 
ence 0 
of the rat (17). 
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United States have, however, suggested 
that there are marked effects of early 
individual attention, including handling 
and petting, on growth and survival of 
the albino rat. 

Hammett (23) in 1922 found that 
rats stroked or petted were much more 
likely to survive an operation for re- 
moval of the parathyroid gland than 
those that had not been handled. Bern- 
stein (4) reported in 1952 that handled 
rats gained more weight than unhandled 
controls, and also learned mazes faster 
and with fewer errors. 

These and other considerations sug- 
gested the hypothesis that early-handled 
tats would be better able to withstand 
severe stress than nonhandled controls. 
This hypothesis was tested in a series 
of experiments at the University of To- 
ronto under the writer’s direction. 

Handling and viability. This research 
has shown that early handling of the 
albino rat increases viability under stress 
as an adult (7). Weininger, in two in- 
dependent experiments (68, 69), found 

that early-handled rats, subjected to 48 
hours of immobilization and food and 
water deprivation as adults, sustained 
less heart and gastrointestinal damage 
than unhandled controls. Long (7) 
found that early-handled rats survived 
complete food and water deprivation as 
adults 53.8 hours longer than matched 
controls, thus Confirming the results of 
an earlier pilot study (67) by Wein- 
inger. In general, early-handled ani- 
mals showed Jess emotional reactivity to 
a strange environment, and their adre- 
nals did not become hyperreactive un- 
der stress (69), These results suggest 
that early handling raises the threshold 
for the pituitary-adrenal response to 
stress. 

In the Toronto studies, early h; 
involved holding the rat loosely in the 
left hand against the experimenter’s 
chest and stroking the rat “with the 


grain,” head to tail, for 10 minutes 


andling 


a 


day, a slight modification of the method 
originally developed by Bernstein (5). 
For convenience through the remainder 
of this paper, this general method will 
be referred to as “Type I” handling. In 
the Toronto studies, handling was car- 
ried on for 21 days after weaning (age 
21 days). 

These findings have been in part con- 
firmed by subsequent experimental work 
in the United States. Levine and Otis 
(36) at Ohio State found that rats han- 
dled prior to weaning showed signifi- 
cantly less mortality following food and 
water deprivation than nonhandled rats 
or rats handled after weaning. Levine 
(32) has further shown that early-han- 
dled mice have a higher threshold for 
emotional reactivity than nonhandled 
controls. Levine has also found re- 


cently (34) that adrenal weights of rats 
handled prior to we: 


cantly lower, 
glucose injectio 


stress between 
trol rats. 

In these studi 
colleagues, howe 
volved removin 


es of Levine and his 


xample, 
and-returning procedure 
Probably involved no more than a 


minute all told, and probably no more 
than 12 seconds’ contact with the ex- 


perimenter.! Yet this abbreviated pro- 


“In a study at Toronto, Newton (7) found 
that prolonged (postweaning) experience with 
the mother rat increased the infant rat’s emo- 
tional reactivity as an adult, compared to han- 
dled rats, and that this extra mothering low- 
ered the basal metabolic rate and weight for 
the rat, compared to controls that had not 
been either extra-mothered or handled. 


* Personal communication with S. Levine, 
1957, 
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cedure was found to have the same pro- 
tective effects under stress * as the To- 
ronto method, also used by Levine and 
Otis in their study (36). For the sake 
of convenience, the kind of handling 
used by Levine will be referred to as 
“Type II” handling in this paper. It 
should be noted that in the Levine 
studies “early handling” refers in gen- 
eral to handling before weaning. 

In a replication of Weininger’s experi- 
ments (68, 69), but using group-housed 
rather than individually-caged rats as in 
the Toronto studies, Stern and Taylor ° 
have recently found significantly reduced 
gastrointestinal lesions in early-handled 
(Type I) rats following 48 hours of im- 
mobilization, compared to controls, but 
no differences in adrenal weight, heart 
damage, or emotional reactivity under 
these conditions. 

An experiment by Scott (59), con- 
trary to the findings of Weininger and 
Levine, showed no differences in weight, 
emotional reactivity, or survival under 
severe physical stress (massive doses of 
thiourea) between early-handled (Type 
I) and control rats. In this experiment, 
however, a severe stress in the form of 
a heat wave intervened between the 
early handling and the experimental 
tests. The effects of such intervention 
of stress on the growth gains usually in- 
duced by early handling are discussed 
below. 

A number of recent studies have sug- 
gested that sensory input from other 
modalities may influence the effects of 
handling on viability, An experiment 


5 In his experiment at Toronto, mentioned 
above, Long (7) found that rats that were 
merely picked up and held against the experi- 
menter’s chest for 30 seconds, and then re- 
turned to their cages, survived longer under 
total food and water deprivation than con- 
trols, but this difference was not statistically 
significant. 

© Unpublished 
Taylor, School © 
versity, 1957. 


1 study by J. A. Stern and R. 
f Medicine, Washington Uni- 


by King’ has shown that even visual 
contact with others of its species has 
survival value for the guinea pig in in- 
fancy, when the animal is already un- 
der dietary stress (vitamin deprivation). 
An experiment by Shelly, Pasamanick, 
and Levine (62) showed that rats held 
loosely in a black box a few minutes 
a day for nearly three weeks evidenced 
only transient growth gains over con- 
trols during the next week, irrespective 
of their age at time of handling. Rats 
handled prior to weaning in this experi- 
ment also showed a decrease in timidity 
in a strange situation that disappeared 
after one trial. Macdonald and Teght- 
soonian (38) found that three minutes 
of intense light stimulation produced a 
transient but significant weight gain in 
rats that had previously received maxi- 
mum visual stimulation in infancy, com- 
pared to controls that also experienced 
the intense light as adults but that had 
been subjected to less visual stimulation 
in infancy. 

These results suggest, as Thompson 
has hypothesized (66), that the total 
quantity of sensory interaction with the 
environment, rather than the specific na- 
ture of the sensory input, is the essential 
factor involved in effects of early ex- 
perience on viability. If this hypothesis 
were correct, then the effects of early 
handling on viability would depend on 
the amount of visual stimulation con- 
comitantly experienced by the young 
animal. It might be expected, in this 
case, that rats handled in the dark 
would be less resistant to stress as 
adults than those handled in daylight 
or under artificial light. 


Basic HYPOTHESES 
Stress threshold, ‘The key hypothesis 
of the position presented here is that 
early handling raises the threshold for 
response to stress, thus conserving the 


7 Personal communication with J. A. King, 


1957. 
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metabolic resources or adaptation en- 
ergy (60) of the Organism.® The re- 
sponse to stress may be differentiated 
into two reactions. Stimulation of the 
adrenal medulla by the sympathetic di- 
vision of the autonomic nervous system 
results in release of epinephrine and 
Nor-epinephrine into the blood stream. 
And, as noted above, stimulating of the 
adrenal cortex by ACTH from the an- 
terior Pituitary results in release of 
adrenocortical hormones, such as corti- 
sone and 17-hydroxycorticosterone, into 
the blood stream, Both these reactions 
are, in general, mediated by the hypo- 
thalamus (25) and they may be con- 
sidered, as Cleghorn (9) suggests, to be 
the two arms of a single response to 


8 Conversely, this suggests that an animal 
with a low 


leakage of its Teserves (which can be conceived 


its capacity to deal with the Major si 
later life, That is to say, 
resistance to later, 
ing of Jones 
the di 


Thus, either 
ance to later, as is hypothesized her 
Stress enhances resistance to later 
Levine has suggested. This Point 
experimentally decidable, using suc 
indices of resistance to stress as the total life- 
Span of the animal. 
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noxious stimuli. It can further be sug- 
gested, with Sayers (55), that both 
arms of the response to stress usually 
occur together, 


oid (17-hydroxycorticosteroid) blood- 
plasma content of early-handled and of 
Stress, after the 
Samuels (44). 
comparison of 
resting 17-hydroxycorticosteroid levels in 
early-handled and in control animals, as 
a further check on the hypothesis of re- 
duced Pituitary-adrenal activity in the 
former. A third, and critical, experi- 
ment would involve Measurement of 
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ACTH blood content in early-handled 
and unhandled adrenalectomized ani- 
mals under stress, using the method of 
Sydnor and Sayers (65). The animal 
of choice for these experiments might 
be the guinea pig, since the resting 
level of 17-hydroxycorticosteroid blood 
plasma content in the rat is so low as to 
be virtually undetectable (12). 


Even if it were further confirmed, 
however, through the experiments sug- 
gested above, that early handling re- 
duces pituitary-adrenal output, this 
would by no means demonstrate that 
such reduction was due to a dampening 
of pituitary-adrenal activity. It could 
equally well be due to a greater capacity 
of the pituitary-adrenal axis in early- 
handled animals for the normal produc- 
tion of adrenal cortical steroids, as Le- 
vine has suggested (35), so that for this 
reason there would be less detectable 
output under stress. 

Levine (35) has recently shown that 
handling (Type II) after birth induces 
significant increases in ascorbic acid de- 
pletion (an index of stress response) in 
rat adrenals under stress, and on the 
basis of this and other work concludes 
that handling accelerates maturation of 
the pituitary-adrenal system in the rat. 
In this connection, Woods (72) has 
shown that the wild Norway rat, with 
markedly greater adrenal capacity than 
the laboratory rat, shows no detectable 
adrenal ascorbic-acid depletion as an 
adult under such stress as low tempera- 
ture, fighting, or loud noise. In this 
case, then, the greater capacity of the 
wild rat results in a reduced pituitary- 
adrenal output under stress, compared 
to the laboratory animal. 

Tf the hypothesis that early-handled 
rats had, as a result of handling, greater 
adrenal capacity than unhandled con- 
trols, were correct, several experiment- 
ally testable hypotheses should follow. 

First, this greater capacity should be 
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evidenced in greater weight of the ad- 
renals under normal conditions, since the 
greater capacity of the wild rat is re- 
flected in adrenals proportionately two 
to four times the size of those of labora- 
tory rats, the difference being in size of 
the adrenal cortex (50). No differences 
between adrenal weights of early-han- 
dled (Type I) and unhandled rats have, 
however, been detected prior to stress 
(69). 

Second, this presumed greater capacity 
and greater normal output should be re- 
flected in a higher adrenal-steroid (17- 
hydroxycorticosteroid) resting level in 
handled, compared to unhandled ani- 
mals. But this would be precisely the 
reverse of the expectation if the posi- 
tion advanced in this paper were true, 
for if anterior pituitary and adreno- 
cortical activity were permanently damp- 
ened under early handling, then the rest- 
ing 17-hydroxycorticosteroid level should 
be lower than that in unhandled con- 
trols. 

The question thus resolves itself into 
whether the obtained reduction in pi- 
tuitary-adrenal output under stress of 
early-handled animals, as found by 
Weininger (69) and Levine (34), is 
due to a concomitant reduction in stimu- 
lation of the anterior pituitary and hence 
in ACTH output, as suggested in this 
paper, or is due to accelerated matura- 
tion and consequently greater capacity 
of the pituitary-adrenal system in early- 
handled animals, as suggested by the 
work of Levine (35). 

A reduction of sympathetic-adrenal 
medulla response in early-handled ani- 
mals under stress is suggested by the 
reduced emotional reactivity that has 
usually, but not always, been found in 
these animals as adults. 

Weininger (69) found that early-han- 
dled (Type I) rats, introduced to a cir- 
cular open field enclosed by three-foot 
metal walls with a 200-watt bulb hung 
over the center, ventured further to- 
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wards the center of this arrangement 
than controls (two experiments), and 
showed more activity than controls (two 
of three experiments, the result being 
reversed in the third). Levine (32) 
found that early-handled (Type 11) 
mice showed less latency in leaving 
their cages for food and in fighting 
other mice than controls. Bernstein 
(5) found that rais handled (Type 1) 
prior to and duel the course of the 
experiment showed more vigorous ex- 
Ploratory behavior in a maze and in 
thele hómé cages than did controls, 
learning mazes faster and with fewer 
piap Hittites fl BARA FPREHYIFY at 
andied rats cou account for both 
thete superior maze performance and 
5reater i Ploratmry activity, 

n a recent experiment, Rosen (51) 
found early-handled (Type I) rats 
dominant in paired food competition 
with unhandled controls, and suggests 
reduced emotional reactivity of handled 
rats as the relevant factor, No cor 


relation was found between weight and 
dominance, 


_ Of the other hand, Oana 30) a 
in a reeen study iat early handlin 
(Type I) of the albino rat had no 
effect on adult emotional hehavior, as 
measured by frequency of defecation, 
number of bo Uses excreted, and activity 
me Bpen feid i 

he baaie Ny pode of teleod sym- 
pathetic-adrenal medulla response of 


eafly-handied animals Under stress cut 


be tested by Cotipartng iac rate ara 
peripheral Sympathetic reaction (blood 
Pressure in the tail, fo 


ae | r example) of 
eatly-handled and control st Be 
stress. Tt could further be tested by 
comparing blood and urine output of 
epinephrine from the adrenal medulla of 
early-handled and control rats subjected 
to noxious stimuli. 

Although a number of interesting 
elean:up studies remain to be done in 


this area, if is the conviction af thë 


writer that the point of diminishing re- 
turns has been reached in current psy- 
chological studies of the effects of the 
early experience of handling on the re- 
sponse to stress, and that the time is 
now ripe to enquire into the underlying 
neurophysiological mechanisms operat- 
ing to produce the observed results, 


Hvrornarsmro Rnaoriyity 


If the effects of early experience of 


handling on viability are to be accounted 


for in terms Of the threshold for streg, 
t 


n the effects of E experience on 
the nevrophysiological determinate of 
No atred TESDONSE MUEL Do dhaten. Mhie 
writer (6) has suggested that early han- 
dling dampens posterior hy pothalanic. 
Feaclivity to sires, Portex (45, 46) 
bar shown that fhis a the area (mam- 
millary bo y and mec an eminence in- 
volved lü the pituitary-adrenal => 


to stress. Such dampening of the pos- 


terior hypothalamus would inhibit the 
sympathetic-adrenal medulla stress re- 
Sponse as well as the Pituitary-adrenal 


cortex reaction, shies both of these 


mediated Dy Thy fesponse to stress are 
44 18 POabetior hy, $ 
(28, 42,46), Dostetior hypottatain 

SM pathetic respo Se: j hi 
Arms tsiially wae "se, Hoth of these 


together, as Sayers 


(55) has pointed Out. Virtually att eon- 
ditions that telease 4 vy ANN) OG Eppe 
aldured stfess ha ~ 


also increase 
sympathetic nervous aelivity and the 


secretion of epinephirine,® 
n accounting for stress damage to the 
cardiovascular and renal systems, the 


Work of Selye (60) has put the onus 
on the pituitary-adrenal response and 
the consequent production of adrenal 
cortical steroids. Recent investigations 
by Goldstein (21) and others have, 
however, shifted emphasis to the sym- 
Pathetic division of the autonomic sys 


* Personaj communication with M. 5: Gohi- 
šicin, 1936, 
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tem as the primary agent in tisstie dam: 
age under stress. For example, adrenal- 
ectomized dogs maintain a precarious 
balance under severe physical stress to 
which they would otherwise succumb, 
providing they have been sympathecto- 
mized or have been treated with drugs 
that block action of the sympathetic 
system (21).° This work of Goldstein 
aid others suggests that adrenal cortical 


hormones have the function of buffering 
the tissues against damage irom epi- 
nephrine and hor-epinephrine. 

Without such preparation of minute 
blood vessels by adrenal steroid hor- 
Manes, autonome enfant} it 
ity appears to result in {eto 5 dam- 
age, and hence probably in the terminal 


conditions of local haemorrhage and Ae- 


creased blood pressure (21). On the 
other hand, interruption of the sym- 


pathetic system protects (he adrenal- 
ectomized animal from stress? Gold- 
stein notes in this connection: “It is as 
if the price paid for a highly reactlve 


system such as the sympathetic is to ex- 


pose the tissue to damage with the 
steroids functioning to support the tissue 
in meeting the metabolic stimulation of 
the adrenalins,” ° 

Other experiments have showä the 
protective effects of sympatheti¢ block: 
ing agents in tumbling shock (52) and 


in tourniquet and burn shock (40), ft 
bal 


may therefore be suggested hole I 
raul aa fist aih aampening ot 
poyi from jyst syeh a dam > 
sympat atte activity tinder stress. This 
a arian ort he relative. ari 
comparison 9 platiy: 

i of early handling a of syni- 
pathetic blocking agents for rats under 
severe physical stress, such as tumbling 
shock. The hypothesis could be indi- 
rectly tested by comparing the startle 


d peripheral athetic response (a8 
fu tall blood ote to sudden stimuli 


j: ats ang controls: 
of arly- handled fibers The major 
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tHéehaniem here suggested t0 account 


for the effects of early handling on via- 
bility remains, however, dampening of 
posterior hypothalamic reactivity (6). 
Inhibition of the sympathetic autonomic 
response, and of the pituitary-adrenal 
response, would be subsidiary effects of 
this change in hypothalamic function. 
This hypothesis could be tested directly 
by aor it kort activity in the 
posterior hypathmemus or sativa: 
dled and control rats ünder physical, 
physiological, and emotional stress. For 
exatiple, the electrical response 1P Epi 
nephrine injection could be compared in 
bandied and control animals: 

Fhe neuro) ealologiea) pectian ions I: 
volved in such postulated posterior hy- 

othalamic dainpening have not been 


investigated. 
Recent experimental work by Redgate 


and Gellhorn (15, 49) suggests, how- 
ever, that balance of excitability be- 
tween the anterior and posterior hy- 
pothalamus determines pulse-rate and 
blood-pressure levels. Coagulation ot 
drug-induced reduction of posterior hy- 
pothalamie excitability resulted in A re- 
duction af blood pressure and ET vale 
Coagulation of the anterior hypettals- 


mus resulted in opposite changes. 
ther research by Shimazu et al, (63) 


in Japan has shown that stimulation of 
parasympathetic hypothalamic centers in 
the i bit inhibits seeretion in the pot: 
terior halamus öl the RUTOSGGES 
vaio Teit nin indieate that 


ihe neurosecretory material i% released 
into the aA ot the pee 
lood vessels by $ ) = 
s sympathetic centers of the hypo- 


thalamus, or by such stresses as wounds, 


burns, oF Poison. 


30 The neurosecretory oie, whi ae 
pural impulses into 

equipped to transform neura 
aoe activity, thus provide the mechanism 
the [wo great Mtegrating systema 


eka body (56); 
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The work of Scharrer (56, 57) and 
Harris (25) has indicated that it is the 
transmission of this material from the 
posterior hypothalamus to the anterior 
pituitary through the portal blood ves- 
sels that initiates the release of ACTH 
from the pituitary and thus triggers the 
pituitary-adrenal response to stress, par- 
ticularly when the noxious stimulus is 
mediated by the central nervous system 
(as is the case for emotional stress). 
The important finding of the Japanese 
investigators, for the present purpose, is 
that stimulation of the parasympathetic 
centers suppresses the release of this 
neurosecretory material and causes its 
retention in the axon. 


Those environmental stimuli that ac- 
tivate anterior and parasympathetic hy- 
pothalamic centers would, therefore, at 
the same time inhibit the posterior hy- 
pothalamic nuclei that mediate the two 
arms of the response to stress. The bal- 
ance between activity of the anterior 
and posterior hypothalamus can thus be 
termed the threshold for stress, The 
more dominant the posterior hypotha- 
lamic centers over the anterior, the 
greater the probability that a neutral 
stimulus would result in a stress re- 
sponse, and thus the lower the threshold 
for this response, 

To account for the effects of early 
handling on viability of the albino rat, 
it can now be Suggested that early han- 
dling stimulates the anterior hypothala- 
mus and parasympathetic centers and 
thus inhibits the posterior hypothalamic 
response to stress. If this sensory input 
from handling occurs early enough in 
the development of the organism, then 
the balance is Permanently shifted in 

favor of the anterior region, 

The relative dominance of one or the 
other of these two response systems, de- 
termined genetically or by early experi- 
ence, can thus be considered the thresh- 
old for stress. That is to say, the prob- 

ability that the rat will respond to a 
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neutral environmental stimulus with a 
protein catabolic reaction is, in part, de- 
termined by the rat’s early sample of 
what life is like for rats. 

The central proposition of this paper 
is, therefore, that early handling alters 
the balance of hypothalamic excitability 
in favor of the anterior (and parasym- 
pathetic) regions, thus having the joint 
effects of (a) reducing pituitary-adrenal 
and sympathetic-adrenal medulla re- 
sponses under stress by the mechanism 
noted above, and (b) stimulating ana- 
bolic processes, such as an increase in 
Secretion of growth hormone from the 
anterior pituitary (handling has been 
found to increase weight and skeletal 
length of rats in a number of experi- 
ments, noted below), through a mecha- 
nism not yet determined. 

The anticatabolic effects of early han- 
dling in reducing pituitary-adrenal and 
sympathetic-adrenal medulla responses 
under stress have been considered above. 
The anabolic effects of early handling 
will now be considered in some detail. 

Anabolic effects. In respect to ana- 
bolic stimulation from early handling, 
the finding of Doolittle and Meade (13) 
that free amino-acid blood-serum con- 
tent of early- and late-handled (Type I) 
rats shows an acute reduction, compared 
to unhandled controls, suggests that pro- 
tein synthesis is increased, and/or pro- 
tein breakdown reduced, in handled ani- 
mals. This indirectly supports the hy- 
pothesis that early handling stimulates 
growth hormone secretion, as suggested 
by Harris (24). 

In this connection, Mogenson et al. 
(41) have found that the weight gain 
usually induced by early handling (Type 
I) is nullified either by injection of 
cortisone (an adrenal cortical hormone) 
or by the stress of a high-frequency 
sound, both effects presumably operat- 
ing to counteract growth hormone or 
reduce its output. Ruegamer and Silver- 
man (54) have shown that rats han- 
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dled (Type I) two weeks preceding elec- 
tric shock, and with resultant gains in 
growth and food utilization over con- 
trols, lost these advantages when both 
groups were subjected to electric shock. 
Both groups were inferior to nonshocked 
controls. 

This effect of stress on anabolic ef- 
fects of early handling might account 
for the failure to obtain weight gains 
from handling (Type I) in the experi- 
ments of Newton (7), Long (7),’* and 
Scott (59). The heat wave that inter- 
vened in the latter experiment may have 
served to wipe out any differences be- 
tween experimental and control groups. 
The Newton and Long results may have 
been related to the exposure of their 
rats to cold in transit from upper New 
York State to Toronto in the winter, 
but this is only speculation. In any 
case, a majority of studies in the litera- 
ture #2 have shown that early handling 
induces weight gain in the rat, and 
Ruegamer et al. (53) have shown this 
gain (from Type I handling) to be the 
result of better food assimilation rather 
than of greater consumption. T his again 


11 Personal communication with H. G. Long, 
1955. 

12 These include an original experiment by 
Bernstein (4), three subsequent experiments 
by Weininger (69, 70), three by Ruegamer 
et al. (53), and one by Brooker (8), all in- 
volving Type I handling. Levine found weight 
gains from handling (Type II) in his glucose- 
injection experiment (34). Both Weininger 
(69) and Ruegamer (53) found that early 
handling (Type I) increased skeletal length. 
In another experiment in this area, McClel- 
land (37) found that stroking the rat (in a 
restraining box) with an artist’s brush for the 
same period had exactly the same effect in in- 
ducing weight gain as personal (Type I) han- 
dling by the experimenter. This latter finding 
supports Weininger’s view (67, 70) that the 
essential variable involved in handling is tactile 
stimulation, rather than supporting Bernstein's 
view (5), suggested to account for his’ findings 
on handling (Type I) and maze learning, that 
the essential element in handling is relation- 


ship of the animal to the experimenter. 


suggests the influence of growth hor- 
mone (somatotrophin or STH). 

How increased growth-hormone out- 
put could lead to enhanced survival un- 
der such specific and severe types of 
stress as complete food and water dep- 
rivation, as found for early-handled rats 
in the studies of Long (7) and Levine 
and Otis (36), is suggested by the re- 
lation of STH to cardiac glycogen depo- 
sition. 

Studies of Adrouny and Russell have 
shown, respectively, that STH increases 
cardiac glycogen under normal condi- 
tions and that it has an important role 
in maintenance of cardiac glycogen 
stores during the emergency of fasting 
(1, 2). The greater survival of early- 
handled (Type I) adult rats (age 52 
days) under food and water depriva- 
tion than of controls of the same weight, 
as found by Long (7), is otherwise 
rather difficult to explain; but it could 
be accounted for on the assumption that 
these animals were able to maintain 
cardiac glycogen (a form of carbo- 
hydrate) stores longer than controls. 
This hypothesis could be experimentally 
checked by comparison of cardiac gly- 
cogen deposits in early-handled and con- 
trol rats (a) under food and water dep- 
rivation and (b) under normal condi- 


tions. 


Tf exploration of the hypothesis that 
early handling alters the balance of hy- 
pothalamic activity in favor of the an- 
terior region should prove fruitful, the 
next logical step would be investigation 
of the neurophysiological determinants 
of such altered neuroendocrine response. 

This would require, in the first in- 
stance, recording of anterior and pos- 
terior hypothalamic activity from chroni- 
cally implanted electrodes, to test the 
specific hypothesis of anterior domi- 
nance. The specific hypothesis to be 
tested here would be that, in early-han- 
dled animals as compared to controls, 
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(a) posterior hypothalamic excitability 
would be reduced, and (b) anterior hy- 
pothalamic excitability would be in- 
creased, under both normal and stress 
conditions. 

The hypothesis of anterior dominance 

could be indirectly tested by comparing 
pulse rate and blood pressure of early- 
handled and control rats, under both 
normal and stress conditions. The work 
of Redgate and Gellhorn (49), discussed 
above, suggests that if the anterior hy- 
pothalamus is dominant in early-han- 
dled rats, as hypothesized, these animals 
should evidence, compared to controls, 
(a) a slower return of blood pressure 
from drug-induced hypotension, and (b) 
an increase in reflex slowing of the pulse 
from nor-epinephrine. The most con- 
clusive experiment in respect to anterior 
dominance would, however (as suggested 
above), involve application of a stress 
agent to the organism (epinephrine, for 
example) and consequent recording of 
hypothalamic electrical activity, compar- 
ing early-handled and control rats in 
this respect. 


MODULATION or HYPOTHALAMUS 


Amygdaloid complex. Ifa change in 
the balance of hypothalamic activity is 
suggested to account for effects of early 
handling on viability, then the question 
arises as to just how such a change could 
be effected. Recent studies by Gloor 
(19, 20) and Pribram (47) of the amyg- 
daloid complex? and its diffuse pro- 
jection system to subcortical regions 
suggest that one of its functions is 


While it 
j the olfactory sec- 
ondary centers, it has a good many nonolfac- 


tory connections, particularly somesthetic (39), 
and may be thought of as an olfactory thala- 
mus or integrating center. Cf, Strong and 
Elwyn (64), Ranson and Clark (48), 


to modulate hypothalamic excitability. 
Other research by Schreiner and Kling 
(58) has shown that removal of the 
amygdaloid complex reduces behavior of 
both savage and domestic animals to 
tameness and docility. The primitive 
function of the olfactory cortex, of 
which the amygdaloid complex is an 
element, can be thought of, after Her- 
tick (26), as qualitative discrimination 
between desirable and noxious stimuli 
and consequent nonspecific arousal of 
the appropriate subcortical centers, par- 
ticularly hypothalamic, 

If the function of the amygdaloid 
complex includes discrimination between 
desirable and noxious stimuli, then its 
removal should interfere with such dis- 
crimination. Relevant here is the find- 
ing of Weiskrantz (71) that the effect 
of amygdalectomy on monkeys js to 
make it difficult for the animals to 
identify motivationally relevant stimuli 
(in the words of J. Olds), whether de- 
sirable or noxious, 

A study by Lesse (30) has shown that 
stimulus situations involving behavioral 
responses of agitation, escape, attack, or 
defense induce in the cat a distinct elec- 
trographic pattern (high amplitude, 40- 
45 sec. frequency) in the lateral nuclei 
of the amygdaloid complex. 

Since these behavioral responses are 
precisely those that Hess (27) has ob- 
tained from direct electrica] stimulation 
of hypothalamic centers (also in the 
cat), a connection between amygdaloid 
and hypothalamic response to stress 
would seem indicated here. It should 
also be noted from the Lesse study that 
stimuli which resulted in waking or 
alerting responses only, and which have 
been found in other studies to produce 
large-amplitude, synchronized waves in 
the hippocampus (22), did not induce 
the amygdaloid EEG pattern typically 
elicited by noxious stimuli, such as elec- 
tric shock. 


Indirect 


evidence for amygdaloid 
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modulation of hypothalamic reactivity 
comes from de Florida and Delgado’s 
finding (10) that stimulation of the 
amygdaloid nucleus induces affectionate 
and playful behavior in a group of male 
cats, with effects lasting for several days. 
An experiment of Morgane and Kosman 
(43), however, contrary to the trend of 
the above studies, suggests that the 
amygdaloid complex has no effect on 
hypothalamic reactivity. They found 
that bilateral amygdalectomy in cats 
produced no observable changes in af- 
fective behavior, including reaction to 
noxious stimuli or petting. 

The question remains as to how the 
sensory input from early handling could 
affect amygdaloid activity. Relevant in 
this connection is the study of Machne 
and Segundo (39), who found stimula- 
tion of the sciatic nerve and touching of 
any portion of the skin most effective 
in arousal of electrical activity in the 
amygdaloid nucleus. From this study, 
olfacto-somatic correlation appears to be 
a major function of this subcortical 
center. 

The above work, taken together, sug- 
gests that the balance of hypothalamic 
activity, and therefore the threshold for 
stress, may be determined by amys- 
daloid mediation. The testing of this 
hypothesis would require removal of the 
amygdaloid complex by suction in ex- 
perimental and matched control animals 
before handling, to determine whether 
the effects of handling on viability 
would thereupon be eliminated. Fur- 
ther, electrical recordings of amygdaloid 
activity during stress and under normal 
conditions could be made to determine 
any differences between early-handled 
and control animals. 3 

Hippocampus. The role of the hip- 
pocampus (see Fig. 1) in modulating 
hypothalamic activity also requires €x- 
ploration. Relevant in this respect are 
(a) the finding of Gloor (19) that hip- 
pocampal responses to electrical stimu- 
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lation of the amygdaloid nucleus reach 
the largest amplitudes of any structure 
fired by the amygdala; and (b) a re- 
ported finding of J. W. Mason * that 
electrical stimulation of the hippocam- 
pus itself acts to suppress the pituitary- 
adrenal response to stress. 

These findings would suggest, then, 
that the amygdaloid complex may have 
at least two pathways for hypothalamic 
modulation—a direct projection to the 
anterior hypothalamus via the stria 
terminalis, with direct fiber connections 
(19), and an indirect, multisynaptic 
pathway to the hippocampus, whose 
main efferent projection (the fornix) is 
to the posterior hypothalamus, in the 
region of the mammillary body (see 
Fig. 1). This is, of course, precisely 
the region found by Porter (45, 46) to 
mediate the pituitary-adrenal response 
to stress. 

Delgado (11) found defensive reac- 
tions in monkeys following electrical 
stimulation of the hippocampus and 
fornix, and, on the basis of his own and 
other work, suggests the hippocampus as 
locus for nociceptive sensory integra- 
tions. These and other investigations 
impute to the hippocampus a leading 
role in the mediation of noxious stimuli 
and hence in the neuroendocrine re- 
sponse to stress, although no simple or 
clear-cut picture of hippocampal func- 
tion emerges from this research. 


SuMMARY 


The central proposition of this paper 
is, therefore, that effects of early han- 
dling on viability of the albino rat are 
mediated by a permanent alteration in 
the balance of hypothalamic activity, in 
favor of the anterior region. This al- 
teration results in increased growth hor- 
mone output under normal conditions, 
and decreased activity of the pituitary- 


14 Personal communication with J. V. Brady, 
1957. 


268 Everett W. Bovarp 


o o 


Fic. 1. Diagrammatic representation of the main telencephalo-hypothalamic connections: 
The arrows indicate the direction of nervous impulses in this system. The arrows in the dia- 
gram of the hemisphere outline the main associational connections at the cortical level, which 
are relevant for this system, (From Gloor, P, Telencephalic influences upon the hypothala- 
mus. In W. S. Fields, R, Guillemin, & C. A. Carton (Eds.), Hypothalamic-hypophysial inter- 


relationships. Springfield, TIl.: Charles C Thomas, 1956. Pp. 74-113. Courtesy of the author 
and Charles C Thomas, Publisher.) 

A: amygdala. A6: Area 6 of frontal lobe. E: entorhinal area (pyriform lobe). 
D: dorsomedial nucleus of hypothalamus. F: fornix. 
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adrenal cortex and sympathetico-adrenal 
medulla systems under both normal and 
stress conditions. In turn, this altera- 
tion in hypothalamic balance is itself 
the result of a change in amygdaloid 
complex activity, arising from the sen- 
sory input from early handling. 

Experimental tests of derivative hy- 
potheses have been suggested in the 
course of this presentation. 
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FUNDAMENTAL DIMENSIONAL PROPERTIES OF 
THE OPERANT? 


THOMAS F. 
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At the outset of any inquiry into be- 
havior, and prior to the introduction of 
experimental variation, the investigator 
prepares to perform two classes of op- 
erations. In the first place he locates 
and circumscribes, in space and time, 
the subject matter with which he is con- 
cerned. Secondly, he designates some 
specific dimension or dimensions along 
which quantities of his subject matter 
can be established. It is the purpose 
of this paper to describe the simplest set 
of operations that can be used to es- 

tablish quantities of behavioral subject 
matter. However, the acceptability and 
meaning of any description of a funda- 
mental dimensional set will depend, in 
part, upon what unit of subject matter 
is chosen for purposes of quantification. 
To put it simply, kow one sets about to 
measure largely depends upon what one 
sets about to measure, 

Attempts at systematic designation of 
behavior dimensions have been hamp- 
ered by the confusion about just what 
events enter into the subject matter 
which the dimensions are to describe. 
If we look at what measures behavior- 
ists actually use we see an indefinite 
number of dimensions, Latency, resist- 
ance to extinction, habit strength, am- 
plitude, speed, accuracy, threshold, fre- 
quency, reaction-potential, aggression, 
mels, VTE, rate, and anxiety are only 
the beginning of the list of the most 
common “dimensions.” Jf we reflect 
upon these many different measure- 
ments we should find that many of 
them are, for the sake of measurement, 
reducible to others. For an example, 
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resistance to extinction, as a “measure” 
of reflex strength, is established by 
measuring frequency, amplitude, or la- 
tency after reinforcement has been with- 
drawn. Resistance to extinction is a 
phrase we use to describe changes in 
behavior, measured along some dimen- 
sion, as a function of a certain kind of 
experimental variation. In short, re- 
sistance to extinction designates a func- 
tional relationship. Threshold is an- 
other case in point. We do not “meas- 
ure” threshold, but we measure the 
frequency with which one says, “yes, 
I see the light,” and we describe how 
the frequency of this behavior is func- 
tionally related t 
in the experimen 
sometimes, 

fact that this functional relationship js 
nterest the two 
Seldom, in sci- 


ct the variables 
This is no ground 
nclude that basic 
psychological measurement dimensions 
do not have the characteristics of ratio 
scales, 

It is the first business of science to 
describe functional relations. Tt Carries 
out this business by introducing experi- 
mental variation and measuring the con- 
current changes in the subject matter. 
In order to describe these functional 
relations, some Operations of measuring 
the subject matter necessarily are estab- 
lished prior to the introduction of the 
experimental variations. In any science, 
functional relationships exist in indefi- 
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nite numbers, while measurement di- 
mensions do not. 

The operations of measuring behavior 
and the operations of establishing func- 
tional relations must be independent of 
each other, lest confusion is to arise. 
The procedures we use to describe be- 
havior are established at the beginning 
of inquiry and cannot change unless the 
inquiry itself is redirected. Descriptive 
procedures are required to be independ- 
ent of experimental variation, since these 
procedures are utilized for describing the 
effects of the experimental conditions. 
This is not to say that the quantities 
obtained from descriptive procedures 
must be independent of the experi- 
mental variables; rather, the procedures 
will be independent if the dependence 
of the quantities is to have meaning. A 
set of measurement dimensions is es- 
tablished for the purpose of describing 
functional relationships and will, there- 
fore, not include the very functional re- 
lationships to be described. 

A second difficulty in dimensional 
analysis lies in the confusion about 
what is meant by dimensional independ- 
ence, or unidimensionality. The precise 
distinction that it is necessary to make 
at the outset is that each member of 
the fundamental set represent an inde- 
pendent procedure. This does not mean 
that the outcomes of measurement along 
different dimensions will be uncorre- 
lated. It simply means that there is 
nothing in the procedures themselves 
that necessitates correlation. When we 
ask how many dimensions a house pos- 
sesses, we do not first obtain data on 
the house; rather we first decide along 
what dimensions houses are measurable, 
regardless of the particular quantities 
obtained in the measurement of any one 
house. We say that a house is multi- 
dimensional simply because there 1S 
more than one procedure by which we 
can measure it. A dimension implies 
an operation for obtaining data inde- 


pendent of the particular quantities ob- 
tained on one or more occasions. How- 
ever, in the study of behavior, dimen- 
sional analysis is frequently thought to 
be a procedure for ordering data already 
obtained independently of the methods 
used to obtain it. Thus, we let the uni- 
dimensional characteristics of a set of 
data hinge upon the data’s being un- 
correlated with some other set of data. 
It is not surprising that “unidimension- 
ality” has been hard to come by. It is 
a well known fact that the physical di- 
mensions of mass and extension are fre- 
quently correlated in nature. Neverthe- 
less, we think of them as independent 
dimensions because they are operations 
of measurement that do not necessitate 
correlation. They are, as I shall say, 
logically independent, although not nec- 
essarily empirically independent. 

The analysis that follows does not, 
then, refer to operations for establish- 
ing quantitative functional relationships, 
nor of operations for establishing the de- 
gree of empirical correlation between a 
set of measures already obtained. It re- 
fers, instead, to the simplest set of op- 
erations for obtaining measurements of 
the behavior of organisms. It seems 
clear that those who investigate behav- 
ior typically investigate that class of 
behaviors that Skinner has called op- 
erant (8, p. 65). Skinner's well known 
definition is suitable for the purposes of 
this paper. The dimensions set forth 
here will, I think, be relevant to the 
class of respondents as well, although I 
am not fully satisfied that this is so. 

Once the behavior is specified as an 
operant, it still has no empirically de- 
termined quantity. A definition of the 
operant denotes operations that yield its 
qualitative characteristics and establish 
it as subject matter. There remains the 
problem of describing the simplest pos- 
sible operations that can be performed 
in the determination of empirical quan- 
tity. These operations, once they are 
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specified, I shall call primary dimen- 
sions. Primary dimensions will be those 
operations of measuring that yield quan- 
tities that are of an irreducible sim- 
plicity. These quantities will also pos- 
sess the characteristics of ratio scales. 
Now these quantities will all be de- 
rived by the physical operations of the 
measurement of time and space, and can 
be reduced to these operations; but in 
being so reduced, their behavioral con- 
venience will be lost. They are irre- 
ducible only in the sense that further 
reduction takes the quantities out of the 
realms of behavioral discourse. So long 
as we regard the phenomena in the 
realms of psychological investigation as 
physical and extensible, as I shall do 
here, measurements in time and space 
are called for. As Bridgman (1) has 
said, dimensions are matters of con- 
venience so long as they are compatible 
with the facts and possess the charac- 
teristics of ratio scales. Time measure- 
ments are themselves extensible, and can 
be measured by the distance a hand 
travels over a clock face. This does not 
remove the fact of the convenience of 
time as a primary dimension in physics. 
The dimensional operations I shall 
specify here are not given as the abso- 
lute and “real” character of behavioral 
data, but are offered as convenient 
methods for assigning quantities to be- 
havioral phenomena. The final test of 
their fundamental properties will be in 
the consequences that accrue from their 
use in the laboratory, As I shall de- 
scribe them, they will be designations of 
certain operations that can be carried 
out by the investigator, who has estab- 
lished an operant as his subject matter. 
They will not refer to specific charac- 
teristics of behavioral data, but to means 
by which these characteristics can þe de- 
scribed. It should also be obvious that, 
if these dimensions can be designated as 
more fundamental than other procedures 
of measurement, these other procedures 


must be reducible to the dimensions de- 
scribed here. 


THE DIMENSIONS 


The task of outlining the nature of 
the dimensions that I consider primary 
may be facilitated by imagining a sim- 
ple laboratory situation. A human sub- 
ject is brought into a room and seated 
before a panel on which is mounted a 
plunger that can be pushed against a 
spring resistance. The plunger is so 
built that, if it is released at all, it will 
return to its normal or starting position 
before it can be pushed again. The 
plunger is mounted on a swivel so that 
the direction of the push is variable. A 
cumulative recorder is attached to the 
plunger so that any movement is re- 
corded. The subject is instructed to 
push the plunger as often as he likes, 
and told he may earn money for his 
work. He is told to begin when the 
experimenter turns on a light and that 
he can stop working whenever he likes. 
The fixed reference space, in this case, 
is the plunger, and the operant is 
plunger pushing. 


Temporal Dimensions 


Latency. The first elementary meas- 
urement operation is that of recording 
the time between the opportunity for an 
operant and its initial occurrence. The 
measure is stated in time units, and can 
be easily obtained from a cumulative 
recorder or a timer that goes into op- 
eration when the plunger is made avail- 
able and cuts off when the plunger is 
first pushed. 

Tempo. Once an operant occurs it is 
frequently repeated, and there is a uni- 
tary quality about the sequence of op- 
erants as they are being repeated. For 
example, the child who pedals a bicycle 
evidences no break between pedaling 
through one revolution and another, and 
one operant leads smoothly into another. 
The rate of this continuously ongoing 
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operant is tempo. In the plunger-push- 
ing example, tempo can be easily meas- 
ured as the slope of the cumulative 
curve during those times in which S is 
working continuously. This measure- 
ment is made quite easily where these 
slopes are linear, which is more often 
the case than not. A decision must be 
made about what is to be considered 
continuous or discontinuous; however, 
this decision is taken care of in the con- 
sideration of the next dimension. 

The measure of tempo as the rate at 
which the operant is being emitted, ex- 
clusive of any breaks in its emission, is 
dependent upon the duration of the in- 
dividual operants in the run. The tempo 
of a single operant is its duration. In 
order to distinguish between the dura- 
tion of the operant and the duration of 
the time in which the organism is emit- 
ting operants, I shall borrow a term 
from physics. We may refer to the 
duration of a single instance of an op- 
erant as the period of that operant. 

Perseveration. The third temporal di- 
mension can be seen where we regard 
the amount of time the organism spends, 
after latency, at the prescribed task. If 
we regard the total time from the oc- 
currence of the first operant to the com- 
pletion of the last, the proportion of this 
time in which operants are actually oc- 
curring is perseveration. Examples of 
this type of variation are numerous. 
For example, after some continuous 
plunger pushing or bicycle pedaling, 4 
person will pause and engage himself 
in some other task before continuing 
with the original one. Just how long 
the pause must be before we decide that 
the task has been discontinued is a® 
arbitrary matter. This question creates 
very little difficulty in normal labora- 
tory situations, however. In the plunger 
example, perseveration is measued by 4 
timer that is in continuous operation 


while S is pushing the plunger. ES 


pauses for as much as five seconds, & 


delay relay shuts off the timer until S 
resumes work. Perseveration can also 
be calculated directly from cumulative 
tracings. The five-second pause as a 
criterion for deciding that S has quit the 
task is arbitrary, but is derived, in this 
case, from observations of how long it 
takes the slowest S to complete the cycle 
of one plunger push. 

Duration. The fourth temporal di- 
mension is the total time from the ini- 
tiation of the first operant to the com- 
pletion of the last, independent of the 
amount of work done within this inter- 
val. The designation of the completion 
of a sequence of operants will vary from 
investigation to investigation. It may 
be in the form of completing a task such 
as filling up a ditch, reaching some cri- 
terion of satiation, or returning to Op- 
erant-level of output. In the plunger 
example, it is simply the time from the 
end of latency until S pushes a switch, 
thus indicating his desire to quit. 

Figure 1 is a schematic presentation 


TLdP TidP 


Fic. 1. A schematic presentation of varia- 
tions along four temporal dimensions, illus- 
trating the independence of each. The figures 
cumulative recordings. Upper case letters 
nt: fast tempo (T), short latency (L), 
and high perseveration (P). 
resent: slow tempo (t), 
ration (d), and low 


are 
represe 
long duration (D), 
Lower case letters rep 
Jong latency (1), short du 
perseveration (p)- 
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of hypothetical results from the plunger- 
pushing experiment. The figure repre- 
sents several cumulative tracings, the 
ordinate being a frequency scale and the 
abscissa a time scale. As can be seen 
readily, the total work output may vary 
as a function of each of these dimen- 
sions, and the variation in each dimen- 
sion is logically independent of varia- 
tions in any other dimension. 


Output 


Total output data will be a function 
of each of these four temporal dimen- 
sions. The dimensions combine in the 
following manner to yield a total out- 
put quantity (where it can be assumed. 
that tempo is linear) : 


R=KTPD —L, 


where R is output in frequency; P is 
perseveration in proportion of total time 
less latency; D is duration in seconds; 
L is latency in seconds; T is the slope 
of the cumulative recording where the 
animal is at work; and K is a correction 
for different units of measurement of 
frequency on the ordinate and seconds 
on the abscissa. KT is the number of 
operants occurring in a second during 
the times in which the animal is actually 
working. It should be noted that the 
spatial dimensions also enter into the 
determination of total output. Here we 
have considered the case in which these 
dimensions are held at some fixed value. 
In the special case in which the task 
is of some fixed length, duration natu- 
rally will be determined by persevera- 
tion and tempo. In this event, time 
rather than output measures are com- 
monly used. It can be seen that time 
is a complex measure, determined by 
the operation of several dimensions, 


Spatial Dimensions 


A most obvious and common measure 
of an operant in space is what has been 
called amplitude or magnitude. How- 
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ever, the behavioral concept of ampli- 
tude may involve two independently 
measurable characteristics. For this rea- 
son I have chosen two names that, while 
not unusual, are seldom used in behav- 
ioral description. 

Intension. The operation that is usu- 
ally indicated by the word “amplitude” 
is the measurement of the amount of 
movement that takes place in some 
specified part of the organism’s topog- 
raphy. This measure of operant ampli- 
tude I shall call intension, and it is 
taken when we regard the amount of 
movement that some specified part of 
the animal makes with reference to some 
other space. In the plunger experiment, 
the measure is simply made by record- 
ing the distance through which the 
plunger moves from the normal position. 
Intension must not be confused with the 
next measure of amplitude. 

Extension. Independently of the 
amount of movement of a specified part 
of the animal, we may measure the 
amount of the animal involved in the 
movement. A simple illustration of the 
distinction between intension and exten- 
sion is made by imagining a hand laid 
palm down on a table surface. We may 
regard the distance through which a 
finger (or fingers) may be lifted (inten- 
sion), or we may regard, independently 
of the distance through which they are 
moved, whether one or more fingers are 
lifted (extension). Russell et al. (5) 
have made a somewhat similar distinc- 
tion in an analysis of quantitative pro- 
cedures. I believe that what I call ex- 
tension is similar to what they have 
called “quantal intensity.” A precise 
terminological comparison is hardly fea- 
sible, since our methods of analysis 
differ rather sharply. The measure- 
ment of extension is difficult to make 
in many behavioral situations, includ- 
ing the plunger apparatus. We may de- 
termine whether the plunger is pushed 
with one hand or two, or we may make 
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arbitrary divisions of the topography 
and measure the involvement of these 
divisions by muscle action-potentials. 

Direction. The third spatial dimen- 
sion is so well known as to require little 
elaboration. This dimension is estab- 
lished by the measurement of the direc- 
tion of movement with respect to some 
fixed reference space. The operation by 
which this measure is taken from the 
swivel-mounted plunger should be ob- 
vious. Accuracy, error, and “prefer- 
ence” are typical expressions of the di- 
rection dimension. 


Two EXPERIMENTAL EXAMPLES 
OF THE ANALYSIS 


Such an analysis as the one offered 
here should have empirical consequences 
to establish its worth to an empirical 
science. I will briefly describe a few 
representative results from two investi- 
gations that may serve to argue for the 
more widespread usage of this analysis 
as well as to indicate the empirical 
properties of several of these dimen- 
sions.” 

The first experiments represented pro- 
cedures similar to those of Guttman (3). 
Although no attempt was made to repli- 
cate Guttman’s work, our data are not 
very different from his findings. Essen- 
tially we allowed fully trained rats to 
receive .05 ml. of a sucrose solution from 
a tube upon pressing a bar (Guttman 
used .005 ml.). One of the major ex- 
perimental variables was the concentra- 
tion of sucrose in solution (4, 8, 16, and 
32%). An important feature of the ap- 
paratus was the fact that the animal re- 
ceived a drop directly into his mouth as 
he pressed the bar, and he never had to 
move away from the bar. In many 
Skinner boxes, the animal moves away 
from the bar and the operants cannot 


2 The data of the first of these experiments 
were obtained in collaboration with Lloyd 


Hitchcock, Jr. (4)- 


occur with the continuity required for 
the measurement of tempo. 

From rather fast-moving cumulative 
recorders, the temporal dimensions listed 
above were measured. Figure 2 sum- 
marizes and schematizes the findings. 
Each of the temporal dimensions is 
plotted as a separate function of the 
sucrose concentrations. Figure 2E rep- 
resents the total output as a function of 
sucrose concentration, and the general 
form of this curve is similar to that 
found by Guttman. However, we can 
see that the total output is analyzable 
into four contributing dimensions, each 
being a different function of the experi- 
mental variable. 

Tempo. This is an invariant, remain- 
ing the same throughout the range of 
concentrations (Fig. 2C). It was also 
discovered that tempo did not change as 
the animal approached satiation; a fact 
that is supported by earlier findings in 
this laboratory, as well as by the data 
of Skinner (7), Young (9), and Sidman 
and Stebbins (6). This invariant char- 
acter of tempo seems to be true for both 
PR and continuous schedules. The gen- 
eralization is that, when the animal 
works at a task, he works at a tempo 
that is characteristic of him and that is 
unaffected by the nature of his rein- 
rcers or the extent of his deprivation. 
Another characteristic of tempo seems 
to be its moment-to-moment invariance. 
In our laboratories we investigated the 
consistency of tempo from day to day 
and from time to time within a day’s 
run. Reliability coefficients, represented 
by linear 7s, are in the neighborhood of 
.98.° There is great variability in tempo 
between animals, and negligible variabil- 
ity within animals from time to time. 


fo 


2 This work was done with W. T. James of 
the University of Georgia, and reported to the 
Southern Society of Philosophy and Psychol- 
ogy, Atlanta, April, 1954. James is primarily 
responsible for the design of the liquid rein- 
forcing apparatus which was used. 
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Fic. 2. Data illustrating how output of bar- 
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Latency. In the sucrose investigation, 
it was discovered that latency is a nega- 
tively accelerated, decreasing, monotonic 
function of reinforcing concentrations 
(Fig. 2A). This is a finding supported 
by too many investigations to need men- 
tioning. The well known variability in 
latency measures was also observed here. 

Perseveration. This was found to be 
an increasing, negatively accelerating, 
monotonic function of sucrose concen- 
tration (Fig. 2B). Actually, variations 
in perseveration determine the greater 
part of the variations in total output, 
and seem to be the most sensitive and 
reliable indices of performance variabil- 
ity. The reliability of these measures 
is quite high. As the animal approaches 
satiation, he perseveres less and less at 
the task, and this fact probably deter- 
mines the negative acceleration of most 
work curves. 

Duration. In the sucrose reinforce- 
ment experiment the animals were al- 
lowed to work a fixed time, so that all 
animals did not reach a reasonable cri- 
terion of satiation. However, estimates 
from these data combined with other 
data taken in our laboratory suggest the 
following conclusions: Duration is a 
positively accelerating, decreasing, mono- 
tonic function of sucrose concentration. 
The greater the concentration of sucrose, 
the earlier does the animal become sati- 
ated (Fig. 2D). However, as depri- 
vation is increased, duration becomes 
greater—a fact that should be disturb- 
ing to anyone who would point to the 
similarity of the effects of drive and re- 
inforcement on the form of total output 
curves. 

A brief summary of data from the 
human plunger-pushing apparatus, de- 
scribed above, may serve to indicate 
some of the generality of these pro- 
cedures and their outcome. The human 
subject is requested to push the plunger 
to work for points. He receives one 
point after the first 200 plunger pushes, 
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Fic. 3. Sample cumulative tracings illus- 
trating the constancy of tempo and the way 
in which total output is determined by varia- 
tion in different dimensions. A represents 
three human Ss pushing a plunger to earn 
grade points. B represents one rat working 
for two different concentrations of sucrose. 


and then no more. He is allowed to 
work as long as he wishes. Figure 3A 
represents the cumulative recordings of 
this work for three different Ss. Fig- 
ure 3B represents the curves of a typi- 
cal rat working for different amounts of 
sucrose. As can be seen, tempo is in- 
variant in all Ss, and tempo, persevera- 
tion, and duration all serve to determine 
the final output.* 


Some IMPLICATIONS or THIS ANALYSIS 


Consideration of Statistics 

It is commonly believed that statisti- 
cal analysis is a necessity in the be- 
havior laboratory. A few investigators, 
such as Skinner, have presented persua- 
sive arguments to the contrary. In spite 
of these arguments, however, data are 


4Skinner (7) once demonstrated the use of 
a water operant. The curves he presents are 
similar to those here, and the constancy of 
tempo can be readily seen, although his tech- 
nique was slightly different. He rejected the 
water operant partly on the grounds of the 
irregularity of the curves obtained. The ir- 
regularity indicated breaks in the work pe- 


riods. 
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often so variable as to appear under- 
standable only by way of statistical 
analysis. It is usually assumed that this 
great variability is produced by uncon- 
trolled and unknown “random” condi- 
tions in the laboratory, and that statisti- 
cal analysis allows us to estimate and 
account for these “random” effects. I 
shall not deny the grounds for the va- 
lidity of this assumption; however, I be- 
lieve that this is a much overworn argu- 
ment. I contend that the compound 
nature of much of our measurement is 
an important source of data variability. 
In the experiment described above, we 
obtained curves shaped like those of 
Guttman (3). However, with twelve 
animals a statistical analysis yielded no 
significant Fs. Guttman had to use 
several dozen animals to obtain Fs that 
were only moderately significant. When 
we analyzed the perseveration scores 
separately, there was no question about 
statistical significance. Such significance 
was found using the analysis of vari- 
ance, but it was quite obvious without 
the use of statistics. This was more re- 
markably the case with tempo scores. 
If, as I have contended, an animal’s 
tempo is invariant under a wide range 
of experimental conditions, it can only 
serve to attenuate differences in total 
output scores produced by experimental 
variation. If an animal spends most of 
his time at a task, and if he works at 
a tempo unaffected by the experimental 
conditions, the tempo will be the great- 
est determinant of the total output 
measure. The output score will then 
consist of a component reflecting not ex- 
perimental variations but great interin- 
dividual differences. In the language of 
statistics this could only serve to in- 
crease the “error term.” 


Misleading Characteristics of Total Out- 
put Measures 


To exemplify the greater need for the 
type of analysis I have proposed, an 


experiment by Fink and Patton (2) 
will be scrutinized. These investigators 
studied the effects of changes in the 
population of stimuli on the decrement 
of conditioned drinking behavior. Rats 
were taught for five days to drink from 
a burette. On the sixth day various 
kinds and degrees of stimulus changes 
were made in the drinking box, and 
measurement was made of the water in- 
take during the first two minutes fol- 
lowing the stimulus changes. It was 
concluded that “the greater the number 
of stimulus components changed, the 
greater the response decrement” (2, pt 
27). The findings of this experiment 
are interpreted by the authors as giving 
support to the S-R contiguity theory. 
The authors note that their measure 
could reflect either rate or amplitude, 
and point that that the Estes-Burke be- 
havior models have not yet made differ- 
ential formulations for rate and ampli- 
tude. However, it is important to notice 
that rate is not, itself, a simple measure, 
Should we assume that their data re- 
flect no changes in amplitude, the inter- 
pretation would remain quite difficult. 
The decrement in the drinking reflex 
could be due to changes in latency. 
Perhaps the animals were “startled.” If 
the decrement did not reflect latency 
changes, but decreasing tempo, the 
meaning of the findings should be con- 
siderably changed. Similarly, changes 
in perseveration or duration would be 
reflected in the total output during the 
two minutes and would deserve different 
interpretations, If “amplitude” ch 
account for the decrement, 
shown whether these changes were in 
intension or extension. Precision in 
mathematical and verbal formulations 
of theory must always follow precision 
in the measurements on which they are 
based. Otherwise, the precision is spe- 
cious indeed. 

The Fink and Patton experiment is 
only illustrative. The amount of pos- 


anges 
it must be 
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sibly misleading output data appearing 
in behavioral journals would require a 
separate review. Experimentation on 
work may serve as another general ex- 
ample. Data from work experiments 
are notoriously variable and the uncon- 
tradicted conclusions about this field are 
few. Output measures are favored, and 
they reflect each of the proposed dimen- 
sions as they combine to yield a total 
score, Precise answers about, say, opti- 
mum rest periods will await the study 
of the effects of these rest periods at 
different tempos, perseverations, dura- 
tions, intensions, and extensions. Simi- 
larly, questions about optimum tempo 
will have answers dependent upon the 
relationship of tempo to perseveration, 
duration, extension, etc. 


A Word About “Mental Tests” 


Intelligence test scores are Com- 
pounds, not only of measurements of 
different operants, but of various dimen- 
sions of several of the separate operants. 
Thus, the Wechsler-Bellevue Digit Sym- 
bol test score will vary as the examinee 
perseveres during the task, as a func- 
tion of his tempo, as a function of la- 
tency, as a function of direction (ac- 
curacy) and perhaps as a function of 
intension and extension when he uses 
the pencil. Only duration is fixed. Two 
identical Digit Symbol scores may have 
entirely different meanings as 4 result. 
Whatever questions concerning the con- 
struction of ratio-scales of measurement 
exist, one thing seems clear: If scores 
are constructed from the undifferenti- 
ated combinations of seven different 
ratio scales, no one may reasonably 
hope that the result will be a ratio 
scale, regardless of what further opera- 
tions are performed upon these scores. 
No two IQs are ever likely to mean the 
same things when such measurements 
are uncritically combined. Even when 
the tests seem to meet the criteria of 
ratio scales, as the Stanford version © 


the Binet-Simon so nearly does,’ the 
comparisons are still meaningless in any 
but the crudest sense. This is not be- 
cause the tasks required are different, 
since they can all be considered as be- 
longing to the same task class; it is be- 
cause the tasks are measured differently 
or by compound procedures. Equal 
things, measured differently, are not 


equal. 
SUMMARY 


Seven dimensions of an operant were 
described as fundamental in the sense 
that other operations of behavior meas- 
urement are reducible to these seven 
and the seven are not further reducible 
within the realms of behavioral dis- 
course. Certain operating assumptions 
were outlined as a basis for the analysis. 

The seven dimensions are (a) latency, 
defined as the time between opportunity 
for an operant and its initial occurrence; 
(b) tempo, defined as the rate of emis- 
sion of a continuously ongoing operant, 
or the period of a single operant; (Q) 
perseveration, defined as the proportion 
of the time from the occurrence of the 
first operant to the completion of the 
last in which operants are actually oc- 
curring; (@) duration, the total time 
from the initiation of the first operant 
to the completion of the last; (e) in- 
tension, the amount of movement that 
takes place in an operant involving some 
specified part of the organism’s topog- 
raphy; (f) extension, the degree of the 
topography involved in an operant; and 
(g) direction, i.e. of the organism’s 
movement with respect to some fixed 
reference point. 

Empirical characteristics of several of 
these dimensions were described. An 
experiment, partly modeled after Gutt- 


5Ẹ. E. Cureton, personal communication. 
Cureton has shown how this test has nearly 
equal SDs from the ages of 3 to 13, and how 
the scale, if extrapolated, places zero M.A. at 


birth. 
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man’s (3) work on sucrose as a rein- 
forcer, was performed to indicate how 
each of several dimensions may be a 
different function of an experimental 
variable, and combine to give a peculiar 
output function. Implications of the 
analysis were discussed, with particular 
reference to the pitfalls of uncritically 
combining the dimensions. 
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In recent years considerable attention 
has been given to the question of how, 
and to what extent, learning may influ- 
ence perception. Most frequently this 
question has been answered by recourse 
to intervening constructs such as organi- 
zations, hypotheses, assumptions, sche- 
mata, etc., as mediators of changes in 
the perception of a given stimulus as a 
function of experience. This approach 
has been aptly termed by Gibson and 
Gibson (16) the “enrichment theory,” 
since it takes “for granted a discrep- 
ancy between the sensory input and the 
finished percept and . . . aim[s] to ex- 
plain the difference. . . . The develop- 
ment of perception must then neces- 
sarily be one of supplementing Or in- 
terpreting or organizing” (16, p. 33). 
Against this view the aforementioned 
authors oppose an interpretation of their 
own, emphasizing progressive differentia- 
tion of the stimulus as the essential ele- 
ment in perceptual learning. According 
to this “specificity theory,” 


_ we learn to perceive in this sense: that 
percepts change over time by progressive 
elaboration of qualities, features, and dimen- 
sions of variation. . - - Perceptual learning, 
then, consists of responding to variables of 
physical stimulation not previously responded 
to (16, p. 34). 


This thesis, as elaborated in their pa- 
per, did not long remain unchallenged. 
In an extended critique by Postman 
(30), it was argued that the specificity 
theory begged the question of learning; 


tten in part during the 
author’s tenure of a NSF postdoctoral fel- 
lowship at the University of Geneva. The au- 
thor is indebted to Jan Smedslund for a criti- 
uscript. 


1This paper was wri 


i.e., of kow the increased differentiation 
of the stimuli was mediated so as to re- 
sult in differential responses. In this 
connection Postman maintains that “the 
changes in response are part and parcel 
of the problem of perceptual learning. 
The need to account for changes in re- 
sponse inevitably endows the problem of 
perceptual learning with an associative 
component” (30, p. 442). But even 
taking the specificity hypothesis as a 
descriptive generalization of the results 
of perceptual learning, Postman argues 
that it is of rather limited generality: 
for instance, it cannot account for the 
perception of signs and symbols. 

It seems impossible to evaluate these 
criticisms without some more clearly 
specified criterion of perceptual learning 
than is provided by either of the par- 
ties to this dispute. Thus the question 
whether perceptual learning is a matter 
of attaching responses to stimuli de- 
pends on the nature of the S-R relation- 
ship that is to be learned. In a typi- 
associate learning task—e.8., 
learning, to identify a set of unfamiliar 
faces by name—it is evident that the 
association of the correct response plays 
a major part in the learning process. 
This is not to deny, of course, the pos- 
sible role of perceptual processes, but it 
is in the very nature of the case that 
these two aspects cannot be disentan- 
gled. Ina situation, on the other hand, 
in which the subject makes judgments 
of “same” or “different,” the attaching 
of responses need hardly be considered, 
as will be shown below. As for Post- 
man’s second criticism, one is inclined 
to agree that in the learning of mean- 
ingful signs and symbols a stress on the 


cal paired- 


cal reading of the man 
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differentiation of characteristics of the 
stimulus is generally beside the point. 
The question, however, is whether a 
theory of perceptual learning need, or in 
fact should, encompass this kind of task 
at all. The answer again hinges on the 
criterion for classifying a learning task 
as perceptual. 

It is thus the intent of this paper to 
Propose such a criterion, which will 
help us in evaluating the arguments 
brought forth in this controversy and, 
more important, enable us to undertake 
a more systematic analysis of the na- 
ture of perceptual learning in a variety 
of specific situations. 


A CRITERION FOR PERCEPTUAL 
LEARNING 


In formulating this criterion, our main 
aim will be to exclude from considera- 
tion situations which necessarily involve 
the formation of new stimulus-response 
associations, since in such situations 
the responses cannot be used to index 
changes on the perceptual side. Accord- 
ingly, the basic ingredient for our defi- 
nition of perceptual learning will be the 
use of responses which have previously 
been established to a class of stimuli or 
to a stimulus relationship; if the stimuli 
which are to be discriminated represent 
instances of this class or relationship, 
the response to it need not then be 
newly associated. Perceptual learning 
might thus be regarded as the devel- 
opment of a transfer of a previously 
learned set of responses to a new set of 
stimuli, the possibility for this transfer 
inhering in the physical characteristics 
of the stimuli. In other words, each 
stimulus to be discriminated must be 

classifiable, gua physical stimulus, into 
the class of stimuli to which the corre- 
sponding differential response is preat- 
tached, and similarly for stimulus rela- 
tionships. 

By this definition, then, paired-asso- 
ciate learning and similar tasks are left 


WoHLWILL 


outside the province of perceptual learn- 
ing situations. We will return later to 
the problem of assessing the role that 
perceptual learning may play in such 
tasks. For the moment it may be noted 
that this restriction likewise excludes 
the learning of signs and symbols, so 
that Postman’s second criticism, referred 
to above, is avoided. 

In a more positive sense, the criterion 
Proposed above has the advantage of 
confronting us with a clear-cut issue 
Concerning the nature of perceptual 
learning. An associational basis is no 
longer forced upon us; instead, two al- 
ternatives now emerge. On the one 
hand, the transfer which the situation 
provides for may take place in fact. In 
this case Perceptual learning may be in- 
terpreted in terms of Progressively more 
effective abstraction of those features of 
stimulation which Tepresent the basis 
for the transfer of the c 
On the other hand, this response may 
be established through association, di- 
rect or mediated, with any differentiat- 
ing aspect of the respective stimuli, 
through differential reinforcement. 

Although the first alternative leaves 
open the precise nature of the mecha- 
nism by which the transfer comes to be 
established, it entails a variety of ex- 
perimental consequences serving to dis- 
tinguish it from the second alternative, 
One of these may be stated at the out- 
set: if associational mechanisms are jn- 
volved, learning should take place only 
as a function of external reinforcement 
—i.e., the subject’s response must have 
some consequence informing him of its 
Correctness or aptness, But such rein- 
forcement need Not intervene in Jearn- 
ing which conforms to the first alterna- 
tive, since this js essentially a generali- 
zation phenomenon. Other implications 
of this distinction will be brought out 
below, in the discussion of specific ex- 
perimental problems, 


orrect response, 
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This brings us back to the interpreta- 
tion of perceptual learning offered by 
Gibson and Gibson (16). Their speci- 
ficity theory, based on stimulus differ- 
entiation instead of mediational proc- 
esses, may be classed with our first alter- 
native (perceptual learning as transfer, 
through abstraction of relevant aspects 
of the stimulus), while the “enrichment 
theory” to which they oppose themselves 
conforms to the second (perceptual 
learning through association). 

The above two formulations of per- 
ceptual learning may, then, be said to 
differ with respect to their conceptions 
of what is learned. As the Gibsons 
note in their reply to Postman (17), it 
is precisely this question which their 
specificity hypothesis is meant to clarify. 
In order to explore this question more 
specifically, however, it will be essential 
to examine a variety of particular situa- 
tions. Indeed, we shall see presently 
that the answer to this what-is-learned 
question depends to a considerable ex- 
tent on the particular discriminative 
judgment made by the subject. 


ANALYSIS OF PERCEPTUAL 
LEARNING SITUATIONS 


It is evident that, by the very na- 
ture of the criterion adopted, perceptual 
learning must be considered relative to 
a particular set of available responses. 
For the purposes of the following sur- 
vey, therefore, the various kinds of 
situations will be differentiated accord- 
ing to the type of discriminative re- 
sponse demanded of the subject. 

On this basis three broad categories 
of tasks may be distinguished: discrimi- 
nation of identity vs. difference, quanti- 
tative discriminations, and qualitative 
discriminations. Each of these last two 
can be further subdivided into a general 
and a specific type. Thus we have, with 
respect to the quantitative category, 
judgments of relationship and ordered 
magnitude judgments; and, with respect 


to the qualitative category, categoriza- 
tion and identification. 


Type I: Identity vs. Difference 


This type represents undoubtedly the 
most basic kind of discrimination; it 
is applicable to any stimulus material 
whatever, simple or complex, familiar or 
unfamiliar. For this reason it lends it- 
self ideally to the study of perceptual 
learning; yet the Gibsons, in their ex- 
ploratory experiment illustrating their 
specificity hypothesis (16), seem to be 
the only ones to have realized the pos- 
sibilities of this form of discriminative 
judgment for the investigation of per- 
ceptual learning. 

The study of this type of differentia- 
tion requires only the prior acquisition 
of the differential responses to identity 
and difference in a group of stimuli.* 
These responses are, furthermore, com- 
pletely nonspecific with respect to the 
aspects of the stimuli to which they 
refer: any differentiating characteristic 
among the set of stimuli constitutes a 
potential basis for the response. The 
abstraction of such a characteristic is 
therefore a necessary and sufficient con- 
dition for the transfer of the response, 
so that intervening associative processes 
may be considered beside the point. For 
this type of learning, in fact, reinforce- 
ment may be altogether unnecessary, as 
was the case in the experiment by the 
Gibsons just cited. In a sense their hy- 
pothesis is thus true here by definition: 
in this situation perceptual learning nec- 
essarily consists in making finer dis- 
criminations among the sources of varia- 
tion present in the stimuli. 


2For nonverbal subjects, these responses 
could be established through preliminary train- 
ing by the matching-from-sample or oddity 
techniques, directed at the formation of learn- 
ing sets. The success which monkeys (cf. 19) 
and even birds (24, 27) have had in solving 
problems of this type argues itself for the 
rudimentary nature of this form of discrimi- 


nation. 
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It is possible, however, to interpret 
this hypothesis in a more specific sense, 
i.e., as postulating discrimination along 
continuous dimensions of variation, as 
opposed to the isolation of discrete cues. 
Thus, in learning to differentiate a set 
of faces, does the observer pick out in- 
dividual details (a crooked nose, a 
double chin, etc.), or does he discrimi- 
nate dimensional differences (width of 
mouth, ratio of distance between eyes 
to that between forehead and chin, 
etc.)? The importance of this question 
lies in the fact that the generalizability 
of the learning is dependent on which 
of these two forms of learning occurs. 
If the discrimination is dimensional, the 
learning should transfer to a new sam- 
ple of faces, whereas little or no trans- 
fer is to be expected from the isolation 
of specific cues. 

It appears likely that the complexity 
of the stimuli will determine in large 
measure the extent to which dimensional 
aspects of the stimuli will be discrimi- 
nated. Thus the nonsense shapes gen- 
erated according to information-theory 
principles (6) suggest themselves as 


promising materials for the study of this 
question. 


Type II: Judgments of Quantitative Re- 
lationships 


Here we find represented the work 
on the reduction of simple differential 
thresholds through practice, which ac- 
counts for a considerable portion of the 
perceptual learning literature (cf. the 
review by E. Gibson [10]). But this 
category includes in addition a variety 
of more complex problems, such as 
the perceptual constancies, illusions, etc. 
The essential element is the presenta- 
tion of stimuli varying on some physi- 
cal dimension, in accordance with one 
of the classical psychophysical methods 
(other than that of single stimuli), or 

some other procedure involving com- 
parative judgment. An additional re- 
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quirement, following from our definition 
of perceptual learning, is that the sub- 
jects have previously applied the re- 
sponse categories to the dimension 
which is manipulated. Although this 
stipulation rarely presents a problem 
with verbal subjects, it does create a 
judgment situation somewhat different 
from that in Type I. 

In order to illustrate the character- 
istic features of this type of judgment, 
let us consider a subject who has been 
asked to judge the relative distance of 
two objects. The first point to be noted 
is that, in contrast to Type I situations, 
the differential responses refer now spe- 
cifically to a particular physical dimen- 
sion. But in the subject’s prior experi- 
ence with this dimension, a complex of 
sensory information, both dimensional 
(e.g., linear Perspective, gradients of 
texture) and nondimensional (eg., in- 
terposition), has necessarily been cor- 
related with the physical dimension of 
distance. One might say, in fact, that 
the concept of distance, connotatively 
considered, is linked to this complex of 
stimulation. 

Accordingly, in order to show im- 
provement due to practice in his dis- 
tance judgments, our adult subject need 
only scan the stimulus field more effec- 
tively, so as to abstract some of this 
correlated information. This view would 
seem to represent the Position taken by 
the Gibsons on the problem of per- 
ceptual learning, Essentially descriptive 
though it remains, such an interpreta- 
tion may yet be strengthened against 
the charge of circularity, to the extent 


that the experimenter succeeds in speci- 
fying the particula 


1 r variables of stimu- 
lation which are being differentiated in 
this situation. The exploration of such 
variables is of course the aim of the 
“global psychophysics” advocated by 
Gibson (15), 

_ Are other explanations for the learn- 
ing observed in this situation possible? 


C 


DEFINITION AND ANALYSIS OF PERCEPTUAL LEARNING 287 


If this learning took place in the ab- 
sence of external reinforcement, it is 
difficult to conceive what they might be, 
except perhaps for general warm-up or 
learning-to-learn effects. The associa- 
tional alternative, in any case, would 
scarcely be applicable, since the forma- 
tion of S-R associations would seem to 
presuppose some reinforcing agent. 

If, on the other hand, reinforcement 
is given, the associational hypothesis 
clearly becomes relevant. Learning may 
consist, then, in a direct association of 
the correct responses with some arbi- 
trary differentiating cues in the stimulus 
objects. One readily testable implica- 
tion of this formulation may be pointed 
out: it should be possible to establish 
increasingly nonveridical judgments as 
easily as increasingly veridical ones, 
simply by reversing the pattern of re- 
inforcements. Any difference in favor 
of veridical learning would support the 
first explanation, in terms of direct 
transfer through improved differentia- 
tion of relevant stimulus variables. 

Judgments of distance represent, of 
course, only one kind of task, and a 
relatively complex one, among the many 
different problems included under Type 
II. It is relevant to ask, therefore, 
whether the above discussion is appli- 
cable generally to other problems of this 
type. The answer depends on the com- 
plexity of the stimulus correlates of the 
physical dimension selected, in the con- 
text of the situation in which the judg- 
ment is to be made. Where these cor- 
relates are as univariate, as dimension- 
ally simple as the physical dimension 
itself (e.g., the loudness of pure tones 
in a noiseless chamber, the brightness 
of flat surfaces under constant illumina- 
tion), there appears to be no opportu- 
nity for the kind of perceptual learning 
postulated by the specificity hypothesis. 
It is perhaps significant, in this connec- 
tion, that in her extensive review of the 
perceptual learning literature (10), E. 

Burges 


Gibson cites no instance of improvement 
in differential thresholds for these in- 
tensitive dimensions. For most sup- 
posedly simple stimulus dimensions, 
however, the sensory input is of a some- 
what higher order of complexity. Par- 
ticularly is this true in the case of 
vision, where there are almost always 
some relationships within the stimulus 
field that vary concomitantly with the 
dimension being manipulated. For ex- 
ample, in as relatively bare a stimulus 
field as that represented by the lines for 
the horizontal-vertical illusion, variation 
of the vertical will be accompanied by 
changes in the (virtual) triangle formed 
by the vertical, the horizontal, and the 
line joining the extremities of these two. 
As for the still more complex, multi- 
variate stimulus fields generally en- 
countered outside the laboratory, it is 
apparent that the opportunity for the 
utilization of a variety of correlated in- 
formation in the proximal stimulus is 
often considerable—a point which the 
Gibsons are at pains to emphasize (16). 


Type III: Ordered Magnitude Judg- 
ments 


If, for the relational responses of 
Type II tasks, we substitute an ordered 
set of responses serving to identify the 
magnitude of a corresponding set of 
points on a physical scale, we arrive at 
a Type III task, commonly known as 
scale learning. Two aspects of scale 
learning may be distinguished: the ab- 
solute and the relative. It will become 
apparent that, when only the second of 
these aspects is considered, the general 
nature of the problem resembles in 
many ways that of a Type II task. But 
since scale learning has typically been 
studied in regard to its absolute aspect, 
let us start by examining it. 

Scale learning in the absolute sense 
consists in the identification of the 
points on the stimulus scale in terms of 
their absolute physical magnitude. The 
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difficulty with this aspect, from the 
standpoint of the present approach to 
perceptual learning, is that one can 
rarely be confident that the differential 
responses have been learned beforehand. 
Thus one would first have to establish 
them through training, as E. Gibson 
and her associates have done for judg- 
ments of distance over the ground (11, 
12). It was demonstrated in these 
studies that this training transfers to 
subsequent judgments of a new set of 
points on the same field (11), and even 
to points on a different terrain (12). 
This evidence is, however, of equivocal 
relevance to the question of perceptual 
learning, since the training consisted in 
the establishment of the responses, and 
there was no opportunity for learning 
during the transfer series itself. Gibson 
and Bergman (11), as a matter of fact, 
recognize the ambiguity of their results, 
for they point out that the transfer they 
found may reflect chiefly the improved 
conceptualization of the stimulus scale, 
through a more accurate notion of the 
unit of measurement. 

An alternative possibility, discussed in 
part by these authors, appears of greater 
interest for the study of perceptual 
learning: the subjects may have become 
more accurate in their discrimination of 
the relative ordering of the points, and 
of the relative distances between them. 
This brings us to the relative aspect of 
scale learning, which, although it has 
not been explicitly studied thus far, 
lends itself much more readily to an 
analysis in conformance with our defi- 
nition of perceptual learning. 

For this aspect requires only that the 
subject have learned a set of responses 
conforming to the ordinal, interval, or 
ratio character of the stimulus scale. 
He would then be in a position to ap- 

ply this response set to the perceived 
ordering of the points on the stimulus 
scale, and, where applicable, to the per- 
ceived metric of the scale itself. We 
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would thus be interested primarily in 
the regression, linear or otherwise, of 
the judgments on the objective values, 
and the problem of perceptual learning 
would reduce to the approach of this 
regression to unity. 

The above approach rests, of course, 
on the assumption that the subject’s 
numerical responses reflect the metric 
character of the perceived magnitudes. 
Garner, Hake, and Eriksen (9) have 
questioned the validity of this assump- 
tion, pointing out that the subjects may 
use the numbers, not in terms of their 
arithmetic relationships, but rather as 
identifying responses, or perhaps as a 
set of ordered responses. The former 
would be true particularly where the 
subject has a record of prior associative 
experience in the application of num- 
bers to different magnitudes on a di- 
mension, as in the case of size, where 
particular values are frequently associ- 
ated with individual objects. 

This is certainly a valid criticism, but 
it may not be critical in the study of 
perceptual learning. For the question 
remains how to account for the improve- 
ment in a set of judgments in a learning 
task. It may be, of course, that the op- 
portunity for the association of num- 
bers with individual points on the scale 
is provided by the learning situation it- 
self. An example of this kind may be 
seen in an earlier study of distance esti- 
mation by Gibson and Smith (13), in 
which it was observed that the subjects 
merely learned to associate specific, iso- 
lated cues on the terrain corresponding 
to a given distance with the verbal re- 
sponse which was reinforced for that 
distance. But it should be possible to 
spot learning of this direct associative 
character by means of transfer tests in- 
volving other points on the scale, or a 
change in context, or discrimination on 
other, correlated dimensions. (In this 
particular study, the authors were in- 
terested in transfer to size-at-a-distance 
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judgments. Not surprisingly under the 
circumstances, none was found.) If, in 
terms of the relative aspect of the scale, 
such transfer is found, or if learning oc- 
curs without benefit of reinforcement, it 
would seem to justify the inference that 
the subjects actually did learn to per- 
ceive more accurately the relative mag- 
nitude of the points on the scale.* This 
learning is then presumably attributable 
to improved discrimination of relevant 
stimulus variables, so that the problem 
of the nature of perceptual learning be- 
comes essentially equivalent to that dis- 
cussed under Type II situations. 

Since the discussion so far has cen- 
tered almost exclusively on the scale of 
distance over the ground, it is impor- 
tant to note that this scale is not only 
particularly rich in associated variables 
of stimulation, as noted in the previous 
section, but represents a rather special 
case, insofar as all of the points on the 
scale are continually exposed to the sub- 
ject’s view. This property clearly con- 
tributes to the subject’s chances of ef- 
fectively ordering a set of designated 
distances. For other visual scales (area, 
brightness) one might similarly expose 
the scale in its entirety, but for dimen- 
sions on which only one or a few mag- 
nitudes at a time can be exposed (pitch, 
weight), the situation is probably not 
comparable. At the very least it would 
be essential with such scales to present 
anchoring stimuli, identified as such, at 
frequent intervals during the learning 
task. Otherwise the subject’s response 
scale might easily “drift,” thereby ren- 
dering the results quite equivocal with 
respect to the relative aspect of the 
scale judgments. 


8 It is conceivable, to be sure, that the learn- 
ing may reflect the gradual abandonment by 
the subject of preferential response habits, €g» 
the tendency to respond in round numbers. 
This reservation points to the importance in 
perceptual learning experiments of eliminating 
differential response strengths in advance; cf. 
the discussion of Type V tasks below. 
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Type IV: Categorization 


This type of perceptual learning is 
concerned with the placing of unfamiliar 
stimuli into a set of nondimensional 
categories previously learned by the sub- 
ject. The differential responses are thus 
the concepts representing the categories; 
they are preassociated to a class of 
stimuli sharing certain physical charac- 
teristics, which are present in less readily 
discriminable form in the stimuli to be 
categorized. 

This problem apparently still awaits 
experimental attack, but one way of ap- 
proaching it might be as follows: sets 
of nonsense shapes varying around a 
prototype—as described in the recent 
experiment by Attneave (5)—would be 
selected to establish an artificial set of 
categories; other shapes constructed so 
as to be further removed from their re- 
spective prototypes would then be pre- 
sented for categorization. The question 
here is whether the subjects would, with 
practice, succeed in making the neces- 
sary discriminations along the variables 
on which the categories could be dif- 
ferentiated, or whether the categoriza- 
tion of the new stimuli would proceed 
by direct association, through reinforce- 
ment. This problem is clearly relevant 
to the study of concept formation: does 
a child, in learning to classify a duck 
as a bird, actually discriminate the fea- 
tures of the stimulus object which re- 
late it to other birds with which the 
child is already familiar, or does he 
merely associate the term “bird” with 
this new stimulus? The evidence on 
this kind of question is lacking, how- 
ever. To be sure, a number of concept- 
formation studies have manipulated vari- 
ables relating to the stimulus (e-8., i 
26), but the interest has been limited 
to the effect of such variables on the 
speed of acquisition of the concepts, 
rather than on the kind of discrimina- 
tion of the stimuli achieved. 
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Type V: Identification 


The identification of stimuli by re- 
sponses specific to them is one of the 
more frequently studied types of tasks 
in the investigation of perceptual learn- 
ing. Yet, according to our criterion, 
this very specificity of the responses 
creates an apparent problem: if the re- 
sponses are pre-established to the spe- 
cific stimuli which are to be discrimi- 
nated, these cannot, in themselves, be 
unfamiliar. But how can there be any 
room for learning, then? Indeed, the 
only way in which learning can enter 
into such a situation at all is through 
the presentation of familiar stimuli un- 
der conditions which make recognition 
difficult. But under these circumstances 
the problem of perceptual learning be- 
comes a rather different one, involving 
factors which relate to the general per- 
ceptual conditions, or else to the identi- 
fying responses, rather than to the par- 
ticular stimuli presented. 

This point is well illustrated in some 
of the studies employing methods of 
“impoverished stimulation” to make 
recognition difficult (through exposure 
of stimuli tachistoscopically, in the pe- 
riphery of the eye, under dim illumina- 
tion, over a background of noise, etc.— 
cf. the review by E. Gibson [10]). 
Thus, in the study by Howes and Solo- 
mon (21) on the recognition of words 
in the tachistoscope, the words in the 
second half of the list were identified 
more readily than those in the first half, 
although the words in the two halves 
were not the same. This finding re- 
flects thus a generalized practice effect, 
attributable to mechanisms of a pe- 
ripheral nature perhaps, such as recep- 
tor adaptation or changes in focusing re- 

sponses. 

A factor of a rather different sort, re- 
lating to the response system, may also 
manifest itself in this type of situation. 
Where a considerable number of re- 
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sponse alternatives are available to the 
subject, these may not be equal in 
strength initially. Learning may then 
consist primarily in the gradual elimi- 
nation of these preferential response 
habits. This appears to be the explana- 
tion, for instance, for at least part of 
the learning which occurred in Seward’s 
study of letter recognition (31), in 
which subjects tended to limit their re- 
sponses to but a portion of the alphabet 
at the beginning. 

While one may devise ways of con- 
trolling for these extraneous factors 
(e.g., response strength can be equated 
by making all required responses avail- 
able to the subject throughout the task), 
it is doubtful whether studies employing 
impoverished stimulation methods will 
yield much information of value to the 
study of perceptual learning, There are, 
however, a variety of other ways of 
making recognition difficult. For in- 
stance, the usual brightness relation- 
ships in the stimulus may be reversed, 
by the use of photographic negatives. 
Alternatively, objects may be presented 
from unfamiliar angles or in unfamiliar 
contexts, and, more particularly, in mo- 
tion (cf. 14). If this kind of task gives 
rise to a learning process, it may be of 
considerable interest in its own right for 
the study of perceptual learning. But 
it is apparent that learning in a situa- 
tion of this sort is not easily handled 
either in terms of the differentiation of 
variables of stimulation or in terms of 
associative processes. This type of task 
has, however, been investigated much 
too little to justify speculation as to the 
nature of possible learning effects. 


THE PERCEPTUAL ASPECT OF PAIRED- 
ASSOCIATE LEARNING 


In view of the important role often 
played by perceptual processes in the 
association of new responses to a set of 
stimuli, a criterion for the investigation 
of perceptual learning which seemingly 
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eliminates this problem from considera- 
tion may well be objected to as unduly 
restrictive. But it has been argued here 
only that it is not useful to study the 
paired-associate task itself as a percep- 
tual learning situation, due to the con- 
founding of perceptual and response 
variables. One may still apply the ana- 
lytic framework developed in this paper 
to this problem, by examining the re- 
sults of such a learning process with re- 
spect to the differentiation of stimuli 
that has taken place. This procedure 
may help, in fact, in clarifying some 
issues in the study of paired-associate 
learning. 

Consider the work on the problem of 
stimulus predifferentiation, recently re- 
viewed by Arnoult (4). This problem 
is concerned with the generally positive 
transfer effects obtained by establishing 
verbal responses to a set of stimuli, prior 
to the association of a different set of 
responses (typically motor) to the same 
stimuli. As Arnoult points out in his 
review, the nature of this phenomenon 
remains obscure, and there are at pres- 
ent not even sufficient grounds for choos- 
ing between a stimulus-differentiation in- 
terpretation and other alternative ex- 
planations which have been proposed to 
account for it. It seems plausible to 
relate this equivocal conclusion to the 
ambiguity inherent in the situation it- 
self, with respect to the role of percep- 
tual processes. 

This role could be studied more di- 
rectly by testing for the differentiation 
of the given set of stimuli on one or 
more relevant perceptual dimensions, be- 
fore and after the association task. One 
would then be able to decide what has 
been learned about the stimuli during 
the course of the paired-associate learn- 
ing, and thus to correlate the extent and 
kind of differentiation achieved with the 
learning in the transfer task. 

The design just proposed actually re- 
sembles that used in the studies by 
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Arnoult (3) and Hake and Eriksen 
(18). Parts of these two studies con- 
cerned the effect of the association of 
letters of the alphabet to sets of non- 
sense shapes on the subsequent recog- 
nition of these shapes on a same-differ- 
ent basis. But this particular basis is 
not very revealing, since the mastery of 
the association task already implies that 
the stimuli have been differentiated from 
one another. The fact that in both 
studies the results were actually nega- 
tive may thus seem rather puzzling— 
indeed, they have been cited as negative 
evidence on the question of stimulus 
predifferentiation. The paradox disap- 
pears, however, when we realize that in 
the recognition task of both studies new, 
arbitrarily chosen stimuli were intro- 
duced along with the old ones. Thus 
in order to expect transfer in this situa- 
tion one would have to assume that the 
subjects had learned more about the 
training stimuli than was in fact re- 
quired to differentiate them. 


“Extrinsic” vs. “INTRINSIC” CUES 
IN PERCEPTUAL LEARNING 


The approach to the problem of per- 
ceptual learning presented in this paper 
is intended chiefly for the analysis of re- 
stricted experimental situations, involv- 
ing perceptual judgments under con- 
trolled practice. Yet some of the ma- 
jor issues in perceptual learning refer to 
the process by which the individual, dur- 
ing the course of his development, comes 
to perceive the world as he does. In 
fact, the contrast between “enrichment” 
and “differentiation” processes deline- 
ated by Gibson and Gibson appears to 
be directed at conceptions of perceptual 
development in this more general sense. 

But since the acquisition of the re- 
sponses to stimuli must form an integral 
part of the process of development, it 
becomes difficult in this case to isolate 
the perceptual aspect of what is termed 
“perceptual learning.” Thus, insofar as 
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this long-term learning is concerned, the 
Gibsons’ opposition between enrichment 
and differentiation may not be entirely 
warranted. There are undoubtedly two 
mutually dependent processes going on 
simultaneously: increasing differentia- 
tion of stimuli on the one hand, and 
development of intervening structures, 
such as proposed in “enrichment” theo- 
ries to mediate the resulting discrimi- 
nations, on the other hand. 

Yet in one respect it seems possible 
to formulate a fairly sharp difference 
between the major empiricist interpreta- 
tions of perceptual learning and the 
Gibsons’ specificity theory. This differ- 
ence, which parallels closely that pro- 
posed earlier between direct transfer and 
response association as bases for per- 
ceptual learning, concerns the type of 
features of the stimulus which the in- 
dividual learns to discriminate. 

One may distinguish, in effect, be- 

tween those aspects of the stimulus 
which are extrinsic and those which are 
intrinsic to a particular judgment. The 
Gibsons are clearly concerned with the 
intrinsic stimulus correlates of percep- 
tion—intrinsic in the sense that these 
aspects of the proximal stimulus in- 
variably reflect the particular aspect of 
the physical environment which is be- 
ing judged, by the very nature of the 
sensory apparatus and of the physical 
energies impinging on it. For example, 
the edges of a road seen from differ- 
ent heights will necessarily converge to- 
wards the horizon at differing angles 
(cf. 15, p. 140). Thus the angle of 
convergence constitutes an intrinsic cor- 
relate of perceived height. These cor- 
relates are therefore independent of ex- 
perience with specific objects or in spe- 
cific environments. 

In contrast, the functionalists, as rep- 
resented by the transactional school 
(22) or by Brunswik’s probabilistic 
functionalism (8), do not differentiate 
between such intrinsic information and 
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that conveyed by objects or environ- 
mental features of a more extrinsic type 
with which the subject may have come 
into contact. Thus, for Brunswik, per- 
ception is a matter of the utilization of 
cues of varying degrees of “ecological 
validity,” whatever their origin. Simi- 
larly, the assumptions of the transac- 
tionalists are based in part on experi- 
ence with particular meaningful objects 
(e.g., the typical sizes of ping-pong and 
billiard balls [20]) and in part on ex- 
perience with universal aspects of stimu- 
lation (e.g., the successive stimulation 
of adjacent points on the retina due to 
a moving object [34]).4 

The question which emerges from 
these two Opposing viewpoints is the 
following: are extrinsic and intrinsic 
cues functionally equivalent, or are 
there differences, quantitative or quali- 
tative, in the roles played by these two 
kinds of information? With respect to 
the basic stimulus dimensions of the 
physical world, at least, such differences 
are surely not altogether implausible, It 
seems conceivable, at any rate, that the 
nature of the perceptual mechanism of 
the organism is such as to predispose 
him automatically to respond to infor- 
mation of the intrinsic type, on account 
of its very reliability, 

In order to answer this question, the 
two types of cues must be put into con- 
flict with each other, as in Slack’s (32) 
study of the size estimation of normal, 
over-, and undersized chairs in the pres- 


ence of the ordinary cues to distance. It 


= In Ames’ (2) rotating trapezoidal window 
el 


ect, both of these kinds of cues are invoked. 
There is, on the one hand, experience with 
windows and rectangular objects, and on the 
other hand, experience with perspective trans- 
formations in general, It is of interest that 
Allport and Pettigrew (1) have recently 
thrown doubt on the former of these experi- 
ential factors by showing that amount of ex- 
perience with windows and rectangular forms 


is to a large extent unrelated to the percep- 
tion of this illusion, 


f 
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was found that the judgments were dis- 
placed somewhat towards the assumed 
size of a chair, although the true sizes 
of the stimuli accounted for the major 
portion of the variance in these judg- 
ments by far. But this study tells us 
little about the actual learning processes 
that may be reflected here, as we are 
only shown its end result.’ 

A more direct line of attack on this 
problem would be to juxtapose extrinsic 
and intrinsic cues in a perceptual learn- 
ing experiment. Thorndike, in explora- 
tory and rather sketchily documented 
work (33, pp. 77 ff.), has in effect em- 
ployed this method, demonstrating, for 
instance, that through arbitrary associa- 
tion with an irrelevant cue (shape), non- 
veridical judgments of area could be in- 
duced. The relative importance of the 
two types of information could similarly 
be assessed by correlating them in a 
perceptual learning task, and rendering 
the extraneous cue irrelevant, or even 
reversing it, once the discrimination 
had been established. Of perhaps even 
greater interest might be the introduc- 
tion into a perceptual learning experi- 
ment of a readily discriminated extrane- 
ous cue under probabilistic reinforce- 
ment. Subsequent tests with this cue 
removed would indicate to what extent 
the subject relies on highly visible 
though relatively unreliable cues, in 


5 This shortcoming applies more generally to 
the experimental work emanating from the 
empiricist position. Apart from some animal 
studies, few learning experiments have been 
carried out by these psychologists, as Gibson 
and Gibson (16) point out. Apparent excep- 
tions to this statement are the recent studies 
by Kilpatrick (23) and Weiner (35) on learn- 
ing in the distorted room, and—representing 4 
somewhat different position—by Kohler (25) 
on adaptation to the wearing of reversing and 
distorting lenses. But these are really unlearn- 
ing experiments, in which firmly established 
responses. require modification due to altered 
stimulus conditions. ‘They can thus not be 
assumed to represent the process of perceptual 
learning as it develops originally. 


preference to possibly quite subtle but 
perfectly reliable sensory information. 
This kind of investigation might thus 
be expected to throw light on the rela- 
tive merits not only of the enrichment 
and differentiation hypotheses of per- 
ceptual learning but also, by extension, 
of the psychophysical as compared to 
the functional approaches to perception 
generally. 

Lastly, it may be noted that there is 
a definite connection between the notion 
of intrinsic and extrinsic cues and the 
two alternative interpretations of per- 
ceptual learning developed on the basis 
of our criterion. Once a response to a 
dimension has been established, the pos- 
sibility for learning on a stimulus-trans- 
fer basis, without reinforcement, is de- 
pendent on the presence of intrinsic cues 
which are initially imperfectly differenti- 
ated. As for the associative type of 
learning, it is clearly based on the uti- 
lization of extrinsic cues. The postula- 
tion of these two types of cues appears, 
however, to be relevant mainly to dis- 
criminations along physical dimensions, 
while the distinction between the two 
varieties of perceptual learning may be 
applied more generally, e.g., to cate- 
gorization learning (cf. the discussion 
of Type IV situations, above). 


SUMMARY AND CONCLUSIONS 


The primary aim of this paper has 
been to suggest a criterion for percep- 
tual learning which will allow a concep- 
tual distinction between learning based 
on perceptual functions and that based 
on response association. The approach 
suggested is to regard perceptual learn- 
ing as the development of a generaliza- 
tion of a previously established differ- 
ential response to a new stimulus. Thus 
defined, perceptual learning does not re- 
quire the formation of new S-R _a8so- 
ciations and, in contrast to associative 
learning, may occur in the absence of 
external reinforcement. 
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An examination of five types of judg- 
ments in the light of this definition 
brought out a number of specific test- 
able problems with regard to the nature 
of perceptual learning, but the evidence 
concerning them is for the most part 
either unavailable or inadequate on 
methodological grounds. In several in- 
stances, the relevance of the “specificity 
hypothesis” advanced by Gibson and 
Gibson (16) was noted. This hypothe- 
sis, which describes “what is learned” in 
perceptual learning as an increased dif- 
ferentiation of variables of stimulation, 
may be of particular predictive value 
where the aspects of the stimulus which 
will be differentiated can be specified. 
Finally, an experimental approach to 
the Gibsons’ opposition between “en- 
richment” and “differentiation” theories 
of perceptual learning was suggested, 
through the opposition of intrinsic and 
extrinsic cues in the learning of a dis- 
criminative response. 

This paper, like that of the Gib- 
sons, has sidestepped the question of 
the processes and mechanisms within 
. the organism which mediate perceptual 
learning of the differentiation variety. 
Admittedly, from a theoretical point of 
view, the problem of perceptual learn- 
ing is thus only half solved at best. The 
task remains to analyze the processes 
involved in terms of a set of constructs 
which will permit these phenomena to 
be integrated into a more general per- 
ceptual theory. It is possible that some 
constructs proposed in other contexts— 
e.g., Pitts and McCulloch’s (29) “scan- 
ning mechanisms,” or Piaget’s (28) 
“decentrations”—may be adapted to ac- 
count for improvement in the differ- 
entiation of stimuli. At the present, 
however, they remain largely specula- 
tive and of unknown value in the han- 
dling of perceptual learning phenomena. 
In conclusion, it may be suggested 

that an analysis of the stimulus, aimed 
at specifying what aspects are discrimi- 
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nated as a result of learning, may itself 
contribute heavily to the eventual con- 
struction of adequate mediating mecha- 
nisms to explain perceptual learning. 
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THE CONCEPT OF IDENTIFICATION ! 
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Several years ago Sanford (20) pre- 
sented an analysis of the concept of 
identification. In brief, Sanford sug- 
gested that the term be applied to situa- 
tions in which “an individual may be 
observed to respond to the behavior of 
other people or objects by initiating in 
fantasy or reality the same behavior 
himself . . . the individual strives to 
behave in a way that is exactly like 
that of the object” (20, p. 109). San- 
ford further suggested that the motive 
for this imitative behavior was a threat 
to the person’s self esteem. By limit- 
ing the term “identification” to those 
imitative behavioral sequences in which 
the motivation for the act was anxiety 
over self esteem, Sanford emphasized 
two points: (a) mere similarity in overt 
behavior between a subject and a model 
was not necessarily a measure of identi- 
fication, and (b) the motive for the 
imitative behavior was one of the de- 
fining characteristics of an identificatory 
response. 

The various behavioral phenomena 
which have been labeled “identification” 
differ in their manifest properties and 
motivations. The following four classes 
of behavior have been described as re- 
lated to the process of identification be- 
cause they all can lead to similarities in 
behavior between a subject and a model. 


1 This research was supported, in part, by a 
research grant (M-1260) from the National 
Institute of Mental Health of the National 
Institutes of Health, United States Public 
Health Service. The views of Wesley Allin- 
smith, Vaughn J. Crandall, Leonard M. Lan- 
sky, and Howard A. Moss are especially ac- 
knowledged. A major stimulus for the pres- 
ent essay was a workshop in parent-child 
relations supported by USPHS Grant 1649 
and held at the Merrill-Palmer School, De- 
troit, Michigan, July 14-27, 1957, 


Imitation Learning 


This term refers to the initiation and 
practice of certain responses (gestures, 
attitudes, speech patterns, dress, etc.) 
which are not subject to prohibition by 
the social environment and which are 
assumed to be the result of an attempt 
to imitate a model. The behavior has 
been labeled either “matched-dependent 
behavior” or “copying” by Miller and 
Dollard (17). Miler and Dollard posit 
that initially the imitative act occurs by 
chance and the act can only be rein- 
forced if some drive is reduced follow- 
ing the execution of the response, Ac- 
cording to this view only direct reward 
from the social environment, like praise 
or affection, can strengthen the person’s 
tendency to imitate a model. Mowrer 
(18) distinguishes between developmen- 
tal and defensive identification. In the 
former process, the person imitates or 
reproduces the behavior of a model in 
order to “reproduce bits of the beloved 
and longed-for parent” (18, p. 615). 
Mowrer suggests that most imitation of 
a model is the result of the desire to re- 
produce responses which have acquired 
secondary reward value through asso- 
ciation with a nurturant and affectionate 
model. Thus, Mowrer emphasizes the 
self-rewarding aspect of certain imita- 
tive acts as opposed to Miller and Dol- 


lard’s emphasis on direct reward from 
the social environment, 


Prohibition Learning 


This term refers to the adoption and 
practice of the prohibitions of the par- 
ents and parent substitutes. The ac- 
quisition of these prohibitions bears 
some relation to the process of super” 
ego development as described by PSY 
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choanalytic theory (2, 3, 4, 11). Sev- 
eral investigators have suggested that a 
major motivation for the acquisition of 
some prohibition is anxiety over antici- 
pated loss of love (10, 11, 18, 20, 23). 
Sanford labeled this process “introjec- 
tion” and suggested that the learning 
and maintenance of this class of behav- 
ior might be explained without use of 
the concept of identification. 


Identification with the Aggressor 


This phrase refers to the adoption of 
behaviors which are similar to those of 
an aggressive or threatening model. The 
motivation for this “imitation” is as- 
sumed to be anxiety over anticipated 
aggression or domination by the threat- 
ening model. It is difficult to explain 
this behavior as a product of either 
prohibition or imitation learning, since 
the motive and reinforcement do not 
seem related to anxiety over anticipated 
loss of love or desire for a direct, social 
reward like praise or affection. Anna 
Freud (2) has labeled this phenome- 
non “identification with the aggressor,” 
Mowrer has called this process ‘“defen- 
sive identification” (as distinct from de- 
velopmental identification), and Sanford 
has suggested that the term “identifica- 
tion proper” be restricted to this class 
of behavior. 


Vicarious Affective Experience 


This phrase refers to the experience 
of positive or negative affect on the part 
of a person as a result of an event which 
has occurred to a model. Salient ex- 
amples of this phenomenon are (a) a 
child’s elation or depression at learning 
that his parent is a success or failure, 
or (b) a mother’s elation following the 
success of her child in school. This 
phenomenon of vicarious, affective ex- 
perience has been attributed to a per- 
son’s identification with a model, but 
this affective response has been difficult 
to explain and often neglected by psy- 


chologists investigating the identifica- 
tion process. These four phenomena 
(imitation learning, prohibition learn- 
ing, identification with the aggressor, 
and vicarious, affective experience) ap- 
pear to be mediated by different motives 
and rewards, and an analysis of each of 
them is one purpose of this paper.” 

In different contexts, social scientists 
have used the term “identification” to 
refer to three different sets of variables: 
(a) the process of identification; (b) in- 
dividual differences in the content of the 
behaviors, motives, and attitudes ac- 
quired as a result of the identification 
process; and (c) the differential effect 
of various models that are used during 
the identification process (3, 4, 5, 7, 9, 
11, 13, 15, 16, 25, 26). This paper 
recognizes the relevance of the model 
and content dimensions but is primarily 
concerned with the process of identifi- 
cation, and will attempt to analyze this 
process in behavioral terms. It is sug- 
gested that the process remains the same 
regardless of the models used or the spe- 
cific behavioral content that is acquired 
as a result of an identification. 


Definitions of Identification 


The concept of identification origi- 
nated in psychoanalytic theory, and 
Freud made a distinction between pri- 
mary and secondary identification (3, 4, 
5). Primary identification referred to 
the initial, undifferentiated perception of 
the infant in which an external object 
was perceived as part of the self, while 
secondary identification began after the 
child had discriminated a world of ob- 


2%n an unpublished paper presented at a 
symposium at Harvard University in 1957, 
Bronfrenbrenner described three types of 
identification: (a) anaclitic identification, (b) 
identification with a source of power, and (c) 
identification through reinforcement of a role 
model. These three terms are similar in mean- 
ing to the present phrases of prohibition learn- 
ing, identification with the aggressor, and 
jmitation learning, respectively. 
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jects separate from the self. Freud im- 
plied in his later writings that the proc- 
ess of secondary identification was mo- 
tivated primarily by the motives and 
anxieties created by the oedipal situa- 
tion. In order to reduce the anxiety 
over anticipated aggression or rejection 
from the same-sex parent and obtain 
vicariously the affection of the opposite- 
sex parent, the child identified with the 
former. Identification was described by 
Freud as “the endeavor to mould a per- 
son’s own ego after the fashion of one 
that has been taken as a model” (5, p. 
63). 

Mowrer’s concept of “defensive identi- 
fication,” Sanford’s definition of “identi- 
fication proper,’ and Anna Freud’s de- 
scription of “identification with the ag- 
gressor” are all related to the earlier 
psychoanalytic hypothesis that the threat 
value of the same-sex parent motivated 
the child to identify with him in order 
to reduce the anxiety associated with 
this threat. However, it is suggested 
that an individual may identify with a 
model not only to reduce anxiety over 
anticipated aggression from a model but 
also to experience or obtain positive 
goal states which he perceives that the 
model commands. The thesis of this 
paper is that the motivation to com- 
mand or experience desired goal states 
of a model is salient in the development 
and maintenance of an identification. 
It will be suggested later that two ma- 
jor goal states involved in identification 
behavior are (a) mastery of the envi- 
ronment and (b) love and affection. 
However, it is not implied that these 


3 Tt is assumed that anticipation of a posi- 
tive goal state is associated with the anticipa- 
tion of a change in affect, and thus the phrase 
“experience goal states of the model” will be 
used synonymously with the phrase “experi- 
ence affective states of the model.” This as- 
sumption agrees with McClelland’s definition 
of a motive as an “anticipation of a change in 
affective state” (14, p. 466). 
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are the only goals which an individual 
desires to command. 


Definition 


Identification is defined as an ac- 
quired, cognitive response within a per- 
son (S). The content of this response 
is that some of the attributes, motives, 
characteristics, and affective states of a 
model (M) are part of S’s psychologi- 
cal organization. The major implica- 
tion of this definition is that the S may 
react to events occurring to M as if 
they occurred to him. 


The Acquisition and Maintenance of an 
Identification 

Although identification has been de- 
fined as a cognitive response, it is not 
implied that the content of the response 
is available to consciousness or easily 
verbalized. Thus the terms “cognitive 
response,” “belief,” “wish,” or “assump- 
tion” will be used in this text to include 
cognitive processes not always available 
through verbal report. Identification is 
not viewed as an all-or-none process, 
Identification is a response that can 
vary in strength and there will be dif- 
ferences in the degree to which an S 
believes that the characteristics of a 
model, whether assets or liabilities, be- 
long to him. In addition, the S may 
become identified, to differing degrees, 
with a variety of models. The motives 
and reinforcements that are involved in 
the acquisition and maintenance of this 
cognitive response are elaborated in the 
following assumptions, 


Assumption 1 


Initially the S perceives that the M 
possesses or commands goals and satis- 
factions that the S desires. ‘This per- 
ception leads to a wish to possess these 
desired goal states. 


Assumption 2 


The wish to command the goal state 
of the M leads to the desire to posse’ 
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the characteristics of the M because S 
believes that if he were similar to the 
M he would command the desired goals. 
That is, the S assumes that the more 
similarity there is between the S and M 
the more likely S is to possess or com- 
mand the desired goal states of the M. 

To illustrate, let the S be a child and 
the M a mother, although S and M 
could be an adolescent boy and the 
leader of a group, or a girl and her 
older sister. The child perceives that 
the mother can feed the child, restrict 
the child, obtain articles out of the 
child’s reach, punish the child, etc. 
Thus, to the S, the M appears to com- 
mand desired skills and goal states. The 
discrepancy between the child’s percep- 
tion of his inability to obtain these de- 
sired goals and his perception of the 
more adequate adult elicits the wish to 
Possess or control those goals which he 
perceives that M commands. The per- 
ceptions of the child are subject to dis- 
tortion, and the child may exaggerate 
the degree to which M commands de- 
sired goals. It was assumed (Assump- 
tion 2) that the wish to command these 
goal states led to the expectation that if 
S possessed M’s characteristics he would 
also command these desired goals. There 
often is direct reinforcement of the be- 
lief that to “be similar to” a model is 
equivalent to possessing his positive at- 
tributes. Often, the social environment 
tells the child directly that he is similar 
to a parent in certain characteristics, 
and this communication may be con- 
tiguous in time with statements related 
to some of the model’s desired goal 
states. For example, parents and rela- 
tives may tell the child “You have your 
father’s eyes,” and often add, “You'll 
grow up to be big and strong just like 
Daddy.” It is suggested that these 
statements which associate similarities 
in external attributes with command of 
desired goal states have an important 
effect on the child’s learning about him- 


self, and lead the child to the expecta- 
tion that to be similar to the model is 
equivalent to possessing his positive and 
desirable attributes. 


Assumption 3 


The identification response (i.e., “some 
of the characteristics of the model are 
mine”) is reinforced each time S per- 
ceives or is told that he is similar to the 
M. One type of reinforcement for the 
identification response occurs when an 
S is told directly that he and the M are 
similar in temperament or appearance. 
It is suggested that a second type of 
reinforcement for this cognitive response 
is S’s own perception of similarity to the 
M. Once again, consider the case of the 
small child and his parent. Although 
the child may perceive marked differ- 
ences in size, strength, and skills be- 
tween himself and the M, he may per- 
ceive a similarity in affective states, 
such as joy, anger, or sadness. The 
importance of the perception of simi- 
larities in affective states between the S 
and M is stressed because a major mo- 
tive for identification is a desire to ex- 
perience positive affective states of the 
model. Thus, perception of similarity 
in affect is assumed to have saliency as 
a reinforcement. If the parent becomes 
angry, sad, or happy and communicates 
these affects to the child, the child has 
the opportunity to perceive that he and 
the M experience similar feelings. This 
perception reinforces the belief that 
there is similarity between the S and 
M. In addition to similarity in affec- 
tive states, perception of similarities in 
external characteristics will reinforce the 
identification response. With specific 
reference to the child-parent relation, it 
is assumed that perception of similari- 
ties in sexual anatomy, dress, amount 
and distribution of hair, and other ex- 
ternal attributes are potential rein- 
forcements of the identification. Thus, 
while the identification response is be- 
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ing learned, the major reinforcements 
for the response are perceptions of simi- 
larity between the S and M.* Freud 
suggested that perceptions of similar- 
ity strengthen an identification, for he 
wrote, 


Identification . . . may arise with every new 
perception of a common quality shared with 
some other person who is not an object of 
the sexual instinct. The more important this 
common quality is, the more successful may 
this partial identification become, and it may 
thus represent the beginning of a new tie (5, 
p. 65). 


Assumption 4 


In order for the identification belief 

to be maintained, the S must not only 

perceive similarity between the S and M 

but he also must experience some of the 

desired, affective goal states of the M. 

Thus, if the M were successful or happy 

and S believed that M was experiencing 

positive affect, the S would also feel 
positive affect appropriate to the suc- 
cess, and this experience would reinforce 
his identification. The S also may ex- 
perience affect appropriate to events oc- 
curring to M as a result of the expecta- 
tion that the social environment will re- 
spond to him the same way it responds 
to the M. That is, when the S has 
developed some degree of identification 
with the M he may anticipate that when 
the social environment praises or re- 
wards the M, it will behave similarly to 
him. If, on the other hand, the M were 
sad or criticized, S might experience 


#Tt is suggested that the concept of identifi- 
cation has not yielded to a behavioral analysis 
because the notion of social reinforcement has 
been viewed as a specific action directed at an 
individual by a reinforcing agent. There has 
been a tendency to overlook the possibility 
that a perception, fantasy, or thought may be 
a potential reinforcement of a response, A re- 
cent experimental finding by Estes and Johns 
(1) supports the hypothesis that a person’s 
perception of a situation, even though objec- 
tively inaccurate, can reinforce his subsequent 
behavior. 
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negative affect because of the identifi- 
cation belief that he and the M were 
similar and the expectation that the en- 
vironment might react to him as it did 
to M. However, if no vicarious com- 
mand of desired goals or positive affect 
were experienced as a result of the 
identification, then the response should 
extinguish just as any other habit does 
in the absence of positive reinforce- 
ment.” That is, some degree of identifi- 
cation should be maintained as long as 
S perceives that the M commands de- 
sired goals. When the S no longer per- 
ceives the M in this fashion, then both 
the motivation for the identification and 


the intensity of the positive reinforce- 
ment should decrease, 


The Acquisition of Behavior Similar to 
a Model: The Motives for Imita- 


tion, Identification, and Prohibition 
Learning 


Since perceptions of similarity þe- 
tween the S and M reinforce the identi- 
fication response, the S may imitate the 
M during the acquisition phase of an 
identification in order to increase the 
degree of similarity. It is acknowl- 
edged that the social environment re- 
wards imitative behaviors with affection 
and praise, and these direct, social rein- 
forcements may strengthen the tendency 
to imitate adults independently of any 
identification motives, However, it is 
suggested, along with Sears et al. (23), 
that direct, social reinforcement of imi- 
tative behavior cannot account for all of 
the imitative responses that the S initi- 

5 This view of identification suggests a meas- 
urement operation which differs from the usual 
Practice of assessing similarities in behavior 
between an S and an M, One measure of de- 
Sree of identification would be the degree to 
which an S's affective state or behavior was 
influenced as a result of events that occurred 
fo an M: That is, praise or criticism of a0 
M in S's presence should lead to correspond- 


ing changes in the affective state of an S who 
was identified with the M. 
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ates. A four-year-old child may simu- 
late adult behaviors when the child is 
alone or in situations where the parents 
discourage or punish the imitative re- 
sponse. However, despite the punish- 
ment or absence of social reward for 
some imitative behaviors, the behavior 
continues to be practiced. Sears et al. 
call this behavior “role practice” and 
assume that it is motivated by the 
“desire to reproduce pleasant experi- 
ences” (23, p. 370). Consider the 
three-year-old girl who plays the role 
of mother alone in her room. It is hy- 
pothesized that a potential reinforce- 
ment for this behavior is the creation, 
in fantasy, of perceptual similarity be- 
tween the behaviors of the S and M. 
This perception strengthens S’s identifi- 
cation with the M and allows S to share 
vicariously some of the positive goal 
states which M commands. 

A somewhat different phenomenon is 
the behavior called “identification with 
the aggressor” by A. Freud or “defen- 
sive identification” by Mowrer. Anna 
Freud describes a girl who was afraid 
of ghosts and suddenly began to make 
peculiar gestures as she ran in the dark. 
She told her brother, “there is no need 
to be afraid, you just have to pretend 
that you're the ghost who might meet 
you” (2, p. 119). The present the- 
ory assumes that the child desired the 
threatening power of the feared object 
and this motive elicited the imitative 
behavior. The fantasied perception of 
similarity to the feared model gave S a 
vicarious feeling of power and reduced 
her anxiety over attack. It is suggested 
that “identification with the aggressor” 
does not differ from other identifica- 
tion responses with respect to the basic 
mechanism of acquisition but does in- 
volve a specific motive and goal state. 
Identification with the aggressor in- 
volves a specific relationship between 
the S and M in which S fears the M. 
Thus, S desires the aggressive power or 


threat value of the M in order to reduce 
his own anxiety over anticipated attack. 
It may be misleading to classify “identi- 
fication with the aggressor” as qualita- 
tively different from other identificatory 
behavior merely because the motive and 
goal differ from those involved in other 
identifications. 

A third motive which can lead to be- 
havioral similarity between an S and M 
is anxiety over anticipated loss of love 
or nurturance. It is suggested that 
many social prohibitions which the M 
practices are learned by the S in situa- 
tions in which this anxiety motivate the 
acquisition and maintenance of the re- 
sponse. The reinforcement for the 
learned prohibition is continued accept- 
ance and a consequent reduction in anx- 
iety over rejection. The research of 
Sears et al. (23) suggests a relation- 
ship between “high conscience” in a 
child and a pattern in which the mother 
is nurturant and uses withdrawal of 
love as a disciplinary technique. In 
summary, any one response which is 
imitative of a model may be mediated 
by three different motive-reinforcement 
sequences, and in many instances all 
three may be involved in producing be- 
havioral similarity between an S and 
M.: Thus, “eating neatly,” “getting 
good grades,” or “being nonaggressive” 
could be motivated by the desire for 
praise as in imitation learning, by anx- 
iety over loss of love as in prohibition 
learning, or by the desire to create per- 
ceptual similarity between the S and M 
as in identification. Thus, mere simi- 
larity in overt behavior between an S 
and M may not be the most sensitive 
measure of degree of identification. 

At a more speculative level, it is sug- 


6In a manuscript being prepared for publi- 
cation, H. Kelman suggests that the response 
of conformity to the attitudes of another per- 
son can be mediated by three different mo- 
tives. His analysis of conformity parallels the 
present discussion of imitative behavior. 
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gested that the behaviors which have 
been called “self actualizing” (6) could 
be motivated and reinforced by a de- 
sire for perceptual similarity to an M 
and be an indication of early identifica- 
tion tendencies. Even the most ortho- 
dox supporters of the importance of 
simple imitation learning find it diffi- 
cult to explain the child’s initial imi- 
tations of a model. Once the child has 
begun to imitate a model it is likely that 
praise and recognition from adults could 
maintain this behavior. However, why 
does the child suddenly want to dress 
himself, sit on the toilet alone, or put 
on Daddy’s shoes? It is difficult to ac- 
count for the initial display of this imi- 
tative behavior, and the term “self actu- 
alization” implies that the child has 
some biological drive to use his poten- 
tialities. This hypothesis seems no more 
parsimonious than the suggestion that 
the initiation of these “self actualizing” 
behaviors is motivated by S’s desire 
to create perceptual similarity between 
himself and a model. 


Two Goals Motivating Identification: 
Mastery and Love 


It has been assumed that S’s desire to 
command certain goal states motivates 
his identification with a model. It is 
suggested, for the child especially, that 
two important goal states that the S de- 
sires to command are (a) a feeling of 
power or mastery over the environment 
and (b) love and affection. Attainment 
of these goals should lead to diminution 
in anxiety over helplessness or loneli- 
ness. The young child perceives that he 
is not able to gratify all of his needs 
while the parental model is perceived as 
more capable of dealing with the envi- 
ronment. This discrepancy between the 
S’s perception of his own relative help- 
lessness and the power that he perceives 

M to possess motivates the wish to have 
M’s power and the search for percep- 
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tions of similarity between himself and 
the M. 

Unfortunately, there are no empirical 
studies which directly test these hy- 
potheses because most of the research 
on identification has used similarities in 
behavior between an S and M as the 
measure of identification. However, 
there are some results which are at 
least consistent with the view that the 
child identifies with the more powerful 
parent and the one who is perceived to 
command important sources of gratifi- 
cation. Payne and Mussen (19) re- 
ported that adolescent boys who per- 
ceived the father as rewarding on pro- 
jective tests were more highly identified 
with the father (based on similar an- 
swers to a personality inventory) than 
boys who pictured their fathers as non- 
rewarding. In addition, boys with domi- 
nant and “masculine” mothers tended to 
be poorly identifed with the father. 
P. S. Sears (22) reported a finding that 
is more difficult to explain without use 
of the concept of identification. She 
found, in a doll-play situation, that 
kindergarten girls used the mother doll 
as agent significantly more often than 
the father doll, while boys used both 
mother and father dolls with more nearly 
equal frequency. Since the mother is 
initially the major controller of gratifi- 
cations for both sexes, one might ex- 
pect an initial identification with her 
for both boys and girls. P. S. Sears 
(22) also reports that the kindergarten 
boys who used the mother doll most 
often had mothers who were (a) more 
nurturant than the father, (b) more 
critical of the father, and (c) more re- 
strictive of the child’s mobility outside 
the home. This result is consistent with 
the hypothesis that the child is pre- 
disposed to identify with the parental 
model who is perceived as controlling 
Important goal states. 

A study of Maccoby and Wilson (15) 
furnishes more direct support for the 
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present hypotheses. The authors showed 
movies to seventh grade boys and girls 
and then determined the protagonist 
with whom the child identified. The 
most significant result was that a “boy’s 
choice of screen character (the one with 
whom he was presumed to identify) is 
more closely related to the social class 
level to which he aspires than to the 
level his family currently occupies” (15, 
p. 79). This result suggests that the 
child identified with models who com- 
manded desired goals. 

A second goal state which may moti- 
vate identification is the desire for nur- 
turance and affection. In addition to 
Freud’s classical hypothesis that the 
child identified with the same-sex par- 
ent in order to receive vicariously the 
affection of the opposite-sex parent, 
there are situations in which nonpar- 
ental models command sources of affec- 
tion. The relation between siblings is 
such a situation, and the younger child 
may identify with an older sibling if the 
former perceives that the latter com- 
mands parental affection. The research 
of Helen Koch (12) indirectly sup- 
ports this hypothesis. She reported that 
school-age boys with older sisters tended 
to develop more feminine attributes than 
boys with older brothers. On the other 
hand, girls with older brothers tend to 
be more masculine than girls with older 
sisters. In the experiment of Maccoby 
and Wilson, described earlier, the au- 
thors reported that girls were more 
likely than boys to recall movie con- 
tent involving boy-girl interaction while 
boys were superior on recall of aggres- 
sive acts by the hero. If one assumes 
that the need for affection is stronger 
for girls than for boys, and that the re- 
called content is influenced by the model 
chosen for identification, then these re- 
sults suggest that the specific goal states 
desired by the S determine the models 
chosen for identification. 


Factors Influencing the Strength of 
Identification 


The strength of the identification 
habit, following a basic behavioral law, 
should be a function of the strength of 
the motive and the quality and fre- 
quency of the reinforcement (8). It 
would be predicted, therefore, that the 
most intense identification would occur 
when the S had strong needs for love 
and power, felt incapable of gratifying 
these motives through his own skills, 
and perceived similarity between him- 
self and an M who commanded these 
goals. Utilizing this hypothesis, two 
generalized predictions can be made 
concerning the strength of identification 
for different ages and models. 

1. The strength of identification tend- 
encies should decrease with age because, 
in general, the individual’s ability to 
gratify his needs for mastery and love 
through his own behavior, rather than 
through a vicarious mechanism, should 
increase with development. Thus, the 
identifications of a young child should 
be more intense than the identifications 
of older individuals. 

2. An identification with an M with 
whom S was in direct contact should be 
stronger than with an M with whom S 
was not in contact, assuming that the 
motivation for identification was con- 
stant and the models were perceived 
as equally potent. This statement is 
based on the assumption that the rein- 
forcements of perceived similarity are 
stronger when S perceives the affects and 
attributes of the M directly as opposed 
to instances in which he is merely told 
that he is similar to the M. Thus, 
degree of identification with a father 
with whom S was in contact should be 
greater than with an imagined fantasy 
father whom S had never seen. Only 
very indirect evidence is available to 
support this prediction. However, re- 
ports by P. S. Sears (21) and Sears 


304 JEROME 
et al. (24) suggest that absence of the 
father from the home tends to decrease 
the degree of “masculine” doll play in 
preschool boys while this experience has 
little effect on the doll play of girls, 
The results are open to alternative in- 
terpretations but are not inconsistent 
with the present hypothesis. 


SUMMARY 


This paper has attempted to analyze 
the concept of identification and place 
the concept within a learning-theory 
framework. Identification was defined 
as an acquired, cognitive response. The 
content of this response was that some 
of the characteristics of a model be- 
longed to the individual and the indi- 
vidual behaved as if some of the charac- 
teristics and affective states of the model 
belonged to him. Identification was not 
viewed as an all-or-none process. An 
identification can vary in strength and 
the individual can identify, to differ- 
ing degrees, with a variety of models. 
The motive for the acquisition and 
maintenance of the identification re- 
sponse was a desire for the positive 
goal states commanded by the model, 
and mastery of the environment and 
love-nurturance were suggested as two 
important goals. The reinforcement for 
the acquisition of the identification was 
perceived similarity in attributes be- 
tween the person and the model. Thus, 
the person may strive to imitate aspects 
of the model’s behavior in order to cre- 
ate perceptual similarity between him- 
self and the model. Once the identifi- 
cation was established, the individual 
behaved as if the goal states of the 
model belonged to him and the positive 
affect derived from this vicarious shar- 
ing of desired goal states helped to main- 
tain the identification. 

It was suggested that the usual em- 
phasis on similarities in overt behavior 
between an individual and a model is 
not the best measure of identification, 
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since the motives and reinforcements 
involved in imitation and prohibition 
learning could also explain similarities 
in behavior between two people. A dif- 
ferentiation of imitative behavior based 
on imitation learning, prohibition learn- 
ing, and identification was attempted. 
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INTRALIST GENERALIZATION IN PAIRED-ASSOCIATE 
LEARNING * 


BENNET B. MURDOCK, JR. 


University of 


It seems to be generally accepted that, 
during the course of paired-associate 
learning, generalization tendencies in- 
crease to a maximum and then decrease. 
This hypothesis is endorsed by Osgood 
(16, pp. 537-539), and stated without 
comment by Bugelski (3, p. 410), Hov- 
land (12, p. 666), and Underwood (17, 
p. 408). This increase and decrease in 
generalization tendencies during verbal 
learning was first suggested by Gibson 
(8), who made it an integral part of 
her theoretical formulation. Her Pos- 
tulate 4 states, “Generalization will in- 
crease to a maximum or peak during the 
early stages of practice with a list, after 
which it will decrease as practice is con- 
tinued” (8, p. 206). In this article we 
would like to suggest an alternate but 
opposing hypothesis, that generalization 
is greatest at the start of paired-associ- 
ate learning and only decreases with 
practice. 

In a paired-associate list 2 pairs long, 
four different types of responses can be 
distinguished: correct responses (Sy—R,, 
S;—R,,...S;—Rn); incorrect responses 
(Sa—Rv, Si—Ro,..-Sa—Rnj S:—Ra, 
Sv—R,,...S;—Rn, etc.); intrusions 
(any responses other than R,, Rj,... 
Rn); and omissions (no response). 
Generalization is “. . . the tendency for 
a response R, learned to S, to occur 
when S, (with which it has not been 
previously associated) is presented” (8, 


p. 204). According to this definition, 
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2590, from the National Science Foundation. 
The author would like to thank Spaulding 
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an incorrect response would be a gen- 
eralized response, as both are essentially 
the right response to the wrong stimulus. 
An intrusion would not be a generalized 
response to any stimulus in the list be- 
Cause it is not a correct response to any 
stimulus in the list. And obviously cor- 
rect responses and omissions are not gen- 
eralized responses. Therefore, it would 
seem (and this is the position taken 
here) that in paired-associate learning 
incorrect responses and generalized re- 
sponses are one and the same.? 

On a common-sense level it seems un- 
reasonable to consider errors early in 
learning as generalized responses. If the 
correct response has not been learned 
(ie., if S has not yet learned S,—R,), 
how could an incorrect response indi- 
cate generalization? Also, the definition 
of generalization given above states that 
the response R, must be learned to the 
stimulus S, before S,—R, could be con- 
sidered a generalized response. How- 
ever, if the response Ra does in fact 
occur to stimulus S,, this would almost 
of itself indicate that some learning had 
already occurred; the probability of the 
occurrence of this response (or for that 
matter any other response on the list) 
by chance is so slight as to be negligible. 
Therefore, it would seem quite unlikely 
that an incorrect response could occur 
before some learning had taken place. 

Equating incorrect and generalized re- 
sponses is desirable from the point of 


2 An incorrect Tesponse could also indicate 
Tesponse generalization. However, in this ar- 
ticle it is assumed that the response terms are 
sufficiently dissimilar and/or predifferentiated 
so that no response generalization occurs; al 
generalization will therefore be considere 
stimulus generalization, ` 
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view of simplicity. The definition of a 
generalized response in paired-associate 
learning becomes clear cut and unam- 
biguous when the two are considered 
synonymous. There is no need to set 
up some arbitrary point in the learning 
before which incorrect responses would 
not be generalized responses but after 
which incorrect responses would be gen- 
eralized responses. And, as will be 
pointed out in the final section of this 
paper, this position results in a predic- 
tion about transfer of training which is 
apparently substantiated by the avail- 
able data. Therefore, it is felt that in- 
correct responses at any stage in paired- 
associate learning should be considered 
generalized responses. 

In the standard paired-associate learn- 
ing task S is not informed as to what 
the responses are before he actually 
starts to learn the list. Therefore, he 
must learn that the responses are Ra, 
R,,...R,. This may involve learning 
to make responses that have never previ- 
ously been made, and it will certainly in- 
volve learning to restrict the range of 
responses to Rg, Ry,-.-Rn. Also, S 
must learn the pairings. That is, he 
must learn Se—Ra, S;,—Ro,- - -Su—Rn- 
Furthermore, within limits he is free to 
respond or not as he wishes. There- 
fore, on the first trial of a paired-asso- 
ciate learning task which uses the an- 
ticipation method, the probability of an 
incorrect or generalized response is very 
slight. If S learns to a criterion of so 
many perfect trials, the probability of 
an incorrect or generalized response at 
the end of learning is also very slight. 
Assuming that some generalization 0C- 
curs, there obviously must be first an 
increase, then a decrease in the fre- 
quency of incorrect or generalized re- 
sponses. This does not mean, however, 
that there is an increase and then a de- 
crease in generalization, for under these 
conditions the frequency of incorrect oT 


generalized responses is not an appro- 
priate measure of generalization. 

The frequency of incorrect responses 
is an inappropriate measure of generali- 
zation because omissions can occur, and 
they introduce a systematic bias. Omis- 
sions are most frequent at the start of 
learning, yet this is exactly the time 
when the frequency of generalized re- 
sponses is low. If omissions were pro- 
hibited and S forced to respond to each 
stimulus, almost certainly most of the 
responses (excluding for the moment in- 
trusions) would be incorrect or gener- 
alized responses, not correct responses. 
Therefore, contrary to what Gibson as- 
sumed (9, p. 194), the frequency of 
overt generalized responses is not a satis- 
factory measure of generalization tend- 
encies when omissions are possible, for 
omissions result in a serious underesti- 
mation of the amount of generalization 
that would occur were S forced to re- 
spond. 

Intrusions introduce a further compli- 
cation; they too, if allowed to occur, 
would be most frequent early in learn- 
ing. In the standard paired-associate 
procedure S must learn both the re- 
sponses and the pairings, and intrusions 
indicate that S has not yet learned the 
responses perfectly. But, as Hovland 
and Kurtz (13) have pointed out, this 
is a separate problem from the learning 
of the pairings (or, in the case they 
discuss, the order in a serial list). If 
the chief purpose is to study generali- 
zation, the responses are merely a means 
to an end; one draws conclusions about 
confusion among the stimuli from the 
responses they evoke. Therefore, learn- 
ing the responses merely introduces a 
complication that should be avoided. 

It is felt, then, that to study generali- 
zation tendencies during learning both 
omissions and intrusions should be 
ruled out. This would not be particu- 
larly difficult to do. To exclude omis- 
sions, forced anticipation could be used; 
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S would be instructed always to re- 
spond during the “stimulus-only” pe- 
riod, even if the response were a pure 
guess. To exclude intrusions, § would 
be told the responses (though naturally 
not the pairings) in the instructions. If 
the responses were unfamiliar, a famil- 
larization procedure such as Hovland 
and Kurtz (13) used could precede the 
learning. Under these conditions the 
frequency of incorrect or generalized re- 
Sponses would be an adequate measure 
of generalization tendencies, since omis- 
sions and intrusions had been ruled out. 
Under these conditions only two types 
of responses would be possible, correct 
responses and incorrect or generalized 
responses. Assuming that S followed 
instructions and assuming that learning 
occurred, the frequency of correct re- 
sponses would (disregarding inversions 
and plateaus) increase regularly with 
Practice. Concomitantly, the frequency 
of incorrect or generalized responses 
would decrease regularly with practice. 
And, since it is felt that these are the 
appropriate conditions under which to 
measure generalization, it would seem 
that generalization tendencies have to be 
at a maximum at the start of learning 
and decrease with Practice. There is, 
therefore, only a decrease, and no in- 
Crease, in generalization during paired- 
associate learning. 
The foregoing has been based on the 
assumption that the similarity among 
stimuli is the same. That is, no two 
stimuli are more similar than any other 
two. This is probably impossible to 
achieve in practice, and is often deliber- 
ately violated in order to get groups of 
stimuli within the total list with high 
intragroup similarity. If S, and S, are 
very similar to one another, the prob- 
ability of S,—R, may be greater after 
a certain amount of practice than it was 
originally. However, the fact that the 
probability of this one specific gener- 
alized response has increased does not 


mean that the probability of all gener- 
alized responses to S, has increased. At 
the start of learning (assuming no omis- 
sions or intrusions), the probability that 
Si—R, will occur by chance would be 
1/n, the probability that S,—R, will 
occur by chance would also be 1/n, and 
the sum of the probabilities that other 
incorrect or generalized responses will 
occur by chance would be (n— 2)/n. 
Even though the probability of S,—Ra 
may have increased with practice (and 
it may increase appreciably), provided 
that the probability of the correct re- 
sponse S;—R, does not drop below 
chance (i.e., 1/n), then this increase 
must be accompanied by a correspond- 
ing decrease in the probability of other 
incorrect or generalized responses (i.e. 
(n —2)/n). Therefore, under these con- 
ditions there can be no net increase in 
generalization. 

Suppose, however, that the prob- 
ability of the correct response S,—Ry, 
did actually drop below chance (1/n). 
This could conceivably occur if Sa—Ra 
was learned (ie., if the probability 
of Sa—Ra increased from 1/n to some 
value X, where 1/fn<X< 1.00), if 
So—R, was not learned, and if Są 
and S, were sufficiently similar (in 
the extreme case indistinguishable) to 
produce generalization. Assume that 
there is a decrease in the Probability of 
S;—R,; the maximum decrease possible 
would be from 1/n to 0, or 1/n. At the 
same time there must, of necessity, be 
an increase in the probabil 


ity of a 
Correct response to Sa. This increase 
would be from 1/7 to X, or X= ijn. 


The only way in which th 
a net increase in generali 
encies would be for the d 
probability S,—R, to be greater than 
the increase in the probability Sg—Ra} 
ie., for 1/n to be greater than X — 1/n. 
For this to occur, X would have to 
be smaller than 2/n, or the increase in 
the probability of the correct response 
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zation tend- 
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Sa—R, would have to be less than 1/n. 
With a paired-associate list of any ap- 
preciable length this maximum allow- 
able increase in probability would be so 
slight as to make it doubtful that any 
generalization could occur after so little 
learning. So the conditions necessary 
for a net increase in generalization would 
seem to rule out the possibility that any 
generalization will occur. 

It is possible, of course, for more than 
two stimuli to be similar. If two or 
more stimuli are similar to Sa, the maxi- 
mum allowable increase in the probabil- 
ity of S;—R, becomes »'/, where 7’ is 
the number of stimuli similar to Se. The 
net increase in generalization then be- 
comes n'/n — (X — 1/n), or (n' +1)/ 
n —X. This is the most general expres- 
sion, and, as before, the smaller X is, 
the more generalization tendencies could 
increase but the less likely it is that any 
generalization could occur, However, as 
X approaches 1.00, a net increase in 
generalization becomes impossible no 
matter how many stimuli are similar to 
Sa. So, to summarize, because the in- 
crease in the probability of one specific 
generalized response must be accom- 
panied either by a decrease in the prob- 
ability of other generalized responses to 
the same stimulus or by the increase in 
the probability of a correct response to 
a similar stimulus, it seems highly un- 
likely that, even with unequal intralist 
stimulus similarity, there ever could be 
a measurable increase in generalization 
tendencies with practice. 

The hypothesis presented here sug- 
gests that there is only a decrease in 
generalization tendencies, yet studies of 
conditioning have shown an initial in- 
crease. One explanation for this dif- 
ference is that in paired-associate learn- 
ing all stimuli are presented to S from 
the first trial on. Learning requires the 
formation of discriminations, and gen- 
eralized responses must be extinguished 
if a criterion of learning is to be reached. 
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Fic. 1. Curve for number of correct and 
generalized responses during learning (copied 
from Gibson’s Fig. 3 [9, p. 195]). 


On the other hand, in studies of clas- 
sical conditioning the stimuli used in 
the test for generalization are custom- 
arily not presented until the conditioned 
response has become established, and 
there is no opportunity for prior extinc- 
tion of generalized responses. 


RELATED STUDIES 


At least two studies, one by Gibson 
(9) and one by Gagné (4), have re- 
ported an increase and then a decrease 
in generalized responses in a standard 
paired-associate learning task. The pro- 
cedures of the two experiments were 
very similar, and apparently omissions 
and intrusions were possible in both. 
Therefore, the results of these experi- 
ments do not prove that generalization 
increased before decreasing. However, 
even allowing for the procedure used, 
the evidence they present for an initial 
increase in generalization is less impres- 
sive than first appears. 

In the Gibson study, Vincent curves 
were plotted for correct and generalized 
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Fic, 2. Fig. 1 redrawn with curve for num- 
ber of generalized responses to scale. 


responses. In her Fig. 3, redrawn here 
as Fig. 1, the curve for generalized re- 
sponses appears to do exactly what it is 
supposed to do, increase to a maximum 
early in learning and then decrease, 
However, this is misleading because the 
two curves are not drawn to the same 
scale. The scale of one is ten times that 
of the other. When drawn to the same 
scale (see Fig. 2) the increase in gen- 
eralized responses is relatively trivial 
compared to the increase in the fre- 
quency of correct responses that is oc- 
curring at the same time. 
The reason for emphasizing changes 
in generalized responses in relation to 
changes in correct responses is that a 
prior study by Hovland ( 11) dealt with 
the relative strength of a generalized re- 
sponse throughout learning. This was a 
study of generalization as a function of 
the number of reinforcements in classi- 
cal conditioning, and was evidently in- 
fluential in suggesting to Gibson the 
analogous hypothesis for verbal learn- 
ing. Hovland measured the strength of 
a conditioned GSR after 8, 16, 24, and 
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48 paired presentations. Actually, the 
absolute strength of the generalized re- 
sponse increased with the number of re- 
inforcements, but as it increased less at 
the higher levels than the strength of the 
conditioned response, there was even- 
tually a relative decrease in generaliza- 
tion. 

In the Gagné study (4) the errors also 
increased to a maximum and then de- 
creased. Table 1 shows the number of 
correct responses, the number of general- 
ized responses, and the ratio of correct to 
generalized responses for Group 1, the 
group that demonstrated the greatest ef- 
fect. As the ratio shows, there was a 
slight initial increase in generalized re- 
sponses. However, the increase was only 
from 3.11 to 3.27 while the decrease 
was far larger, from 3.27 to 0.22. So 
here, as in the Gibson study cited above, 
on a relative basis the increase was not 
impressive. And, it is felt, had omis- 
sions and intrusions been ruled out, 
there would not have been any increase 
at all in generalized responses. 


TABLE 1 


TRIAL-BY-TRIAL RECORD oF NUMBER OF 
GENERALIZED RESPONSES, CORRECT 
RESPONSES, AND RATIO OF 
GENERALIZED TO CORRECT 


REsponsEs* 

Triat | Gencraliced einen Ratio 
1 2.27 0.73 3.11 

2 3.87 1,20 3.22 

3 4.80 1.47 3.27 

4 4.93 2.33 2.12 

5 4.40 3.80 1.16 

6 4.73 3.53 1.34 

/ 4.73 4.20 1.13 

8 4.13 4.87 0.85 

9 3.67 5.40 0.68 
10 3.80 6.00 0.63 
11 2.67 6.20 0.43 
12 2.47 7.20 0.34 
13 1.73 7.67 0.23 
14 1.80 8.20 0.22 

a For Gr 


oup 1 from Table 1 and Table 2 of Gagné 
(4, pp. 65 and 67). m Table 1 and Table 
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In addition to the experiments of Gib- 

son and Gagné, there is an experiment 
by Gagné and Foster (6) which also ap- 
pears to confirm the Gibson hypothesis. 
This was a choice reaction-time experi- 
ment in which Ss were given 0, 10, 30, 
or 50 trials on two of the four com- 
ponents of the final task prior to 60 
trials on the complete task itself. Gagné 
and Foster used the number of errors 
as a measure of generalization, and re- 
ported two main findings which appear 
to support Gibson. First, the total num- 
ber of errors on Task 2 increased and 
then decreased as a function of number 
of trials on Task 1. That is, the 10- 
trial group made more errors, but the 
30-trial and 50-trial group made fewer 
and many fewer errors than the 0-trial 
(control) group, respectively. This in- 
crease and decrease in the total number 
of errors is shown clearly in their Fig. 4 
(6, p. 60). The second finding in sup- 
port of Gibson is that the point of maxi- 
mum errors (i.e., maximum generaliza- 
tion) appears to shift to the left as 
practice on Task 1 increases. As they 
report, 
... the point of maximum generalization 
[abbreviated PMG] seems to be passed in the 
zero-trial group at about the 40th trial; in 
the 10-trial group at about the 30th trial; in 
the 30-trial group at about the 15th trial. 
The group having 50 trials of preliminary 
training does not exhibit any marked drop m 
error score (6, pp. 60-61). 


Not only do generalization tendencies 
increase, according to Gibson, early in 
learning but also—and this is what 
Gagné and Foster seem to overlook— 
after the PMG they should decrease. 
Assuming comparable groups, assuming 
that training on Task 1 makes the PMG 
occur earlier on Task 2, and assuming 
that the more training on Task 1 the 
earlier the PMG, then according to Gib- 
son’s theory a group with more training 
on Task 1 should make more errors On 
Task 2 before its PMG than the other 


group. However, after the PMG of the 
second group, the first group should 
make fewer errors. For instance, the 0- 
trial group reached its PMG at 40 trials 
and the 10-irial group reached its PMG 
at 30 trials. Before 30 trials the 10- 
trial group should make more errors 
than the 0-trial group. However, after 
40 trials this should reverse; the 0-trial 
group should make more errors than the 
10-trial group. (The relative frequency 
of errors between the two PMGs cannot 
be predicted without knowing the exact 
shape of the curves.) 

If this reasoning is correct, then it 
is inappropriate to compare groups in 
terms of total number of errors made in 
Task 2. Instead, it should be predicted 
that the group with the earlier PMG 
should make more errors early but 
fewer errors late in Task-2 learning. 
Since Gagné and Foster report the num- 
ber of errors for each group in blocks of 
ten trials each, and since they estimate 
the PMG of three groups, it is possible 
to use their data to test Gibson’s theory 
in this way. Each of the three groups 
was compared with the other two, and 
the direction of the predicted difference 
in errors for each comparison is shown 
in Table 2. In each case the prediction 
was based solely on the relative position 
of the two PMGs, and no predictions 
were made for the trials falling between 


` the two PMGs. The actual results were 


determined from Fig. 3 (6, p. 59) in 
Gagné and Foster’s article. Each pre- 
diction was considered either correct or 
incorrect, and this is shown in Table 2 
by plus and minus signs. As can be 
seen, the predictions based on Gibson’s 
theory are correct seven times and in- 
correct five times. Thus, the theory 
would not seem to be appreciably bet- 
ter than chance. So here, as in the 
other experiments, the results fail to 
provide clear-cut proof that generaliza- 
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TABLE 2 


PREDICTIONS As TO WHICH or Two Groups SHOULD SHow More 
Errors IN Eacu Brock or Trrars@ 


Comparisons 


Blocks of Ten Trials 


1-10 11-20 21-30 31-40 41-50 51-60 
O-trial with 10-trial | 0<10(+) | 0<10(+) 0<10(+) — 0>10(—)| 0>10(—) 
O-trial with 30-trial | 0<30(+) — = — 0>30(—)| 0>30(—) 
10-trial with 30-trial |10<30(—) — — 10>30(+) | 10>30(++) | 10>30(+) 


* A dash indicates that no prediction can be made. 


tion tendencies increase to a maximum 
and then decrease.” 


PREDICTIONS 

The question of whether or not gen- 
eralization tendencies increase early in 
learning is of some importance in that 
the two points of view result in different 
predictions. In a transfer situation with 
an A—B, A—C paradigm, S is learning 
a set of responses to old stimuli. If, as 
Gibson says, a low level of practice on 
A—B results in an increase in generali- 
zation, then it should follow that nega- 
tive transfer would occur; a little prac- 
tice would be worse than none at all. 
On the other hand, if generalization only 
decreased there should be no negative 
transfer under any conditions, only posi- 
tive transfer increasing as the amount 
of practice of the first task increased. 
Gibson discusses this case in her Propo- 


sition XV, “Learning of Pre-differenti-- 


ated Items” (8, p. 222), where she 
points out that positive transfer should 
occur in the learning of well-differenti- 
ated items. 


è Three other experiments (2, 5, 7) in this 
series of studies varied the degree of first-task 
learning so as to permit a test of the Gibson 
hypothesis. In not one of these three experi- 
ments was there any indication that a low 
level of first-task learning resulted in an in- 
crease in errors on the second task, Actually 
the reverse was generally true; in all three 
experiments the more practice on the first task, 
the fewer the errors that usually occurred 
early in the second-task learning, 


Studies in verbal learning using an 
A—B, A—C paradigm cannot be used 
to test this prediction because the inter- 
list response interference may obscure 
any effects due to predifferentiation. 
However, the so-called “stimulus pre- 
differentiation” studies (1, 14) would 
appear to provide an ideal test for these 
predictions. The typical predifferentia- 
tion study uses an A—B, A—C para- 
digm with a relatively high level of 
intralist stimulus similarity, thus maxi- 
mizing the possibilities for reduction of 
generalization tendencies. Low inter- 
list response similarity is also common, 
and this minimizes the possibilities of 
associative inhibition. Thus, in Task 1 
S may have to learn names for complex 
visual stimuli, while in Task 2 S has to 
learn simple motor discriminative re- 
sponses to the same stimuli (15). Ac- 
cording to Gibson, as degree of Task 
1 learning increases, hegative transfer 
should increase to a maximum, then de- 
crease, giving way to positive transfer 
with sufficient practice on Task 1. 

Recently Arnoult has made a survey 
of predifferentiation studies, and con- 
cludes that there js “enough agreement 
in results among various experiments to 
provide generalizable conclusions in two 
broad areas” (1, p, 339). Fortunately, 
one of these two areas is the amount of 
predifferentiation training, and his gen- 
eralization is, “Positive transfer from 
stimulus predifferentiation training may 
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be expected after a minimum of 4 to 8 
pretraining trials and reaches a maxi- 
mum after 8 to 12 pretraining trials” 
(1, p. 341). Further on in the same 
article Arnoult discusses the specific pre- 
diction resulting from Gibson’s hypothe- 
sis, and states (1, p. 344) that there 
is no evidence (“negative results”) for 
negative transfer. Thus, the studies re- 
ported to date give no support to Gib- 
son’s position. On the other hand, they 
are consistent with the viewpoint that 
generalization is greatest at the start of 
learning.* 


SUMMARY 


Gibson has suggested that during the 
course of paired-associate learning gen- 
eralization increases to a maximum and 
then decreases. It is suggested here that 
generalization is maximum at the start 
of paired-associate learning and only 
decreases with practice. Changes in 
generalization cannot be inferred from 
changes in overt errors in the typical 
paired-associate procedure, for omissions 
and intrusions can occur, and these in- 
troduce a systematic bias. When the 
procedure is such that neither omissions 
nor intrusions can occur, the frequency 
of incorrect responses does provide a 
siutable measure of generalization, and 
under these conditions if learning 0C- 
curs generalization can only decrease 
with practice. A 

One prediction that results from the 
position presented here is that with an 
A—B, A—C transfer paradigm and low 
interlist response interference, positive 


4 Goss (10) incorporates the Gibson concept 
of increase and decrease in generalization into 
his theoretical system, and makes the predic- 
tion that high similarity stimuli will actually 
result in less transfer than stimuli of medium 
similarity. Here too the prediction from Gib- 
son’s theory seems to be wrong; in a recent 
study (15) in which similarity was varied, the 
high-similarity group was the only group 
showing significant positive transfer. 


transfer should increase as the degree of 
first-task learning increases. Studies of 
stimulus predifferentiation provide an 
appropriate test, and the results of these 
studies appear to support this prediction. 
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TOWARD A UNITARY THEORY OF PERCEPTION 


KAI von FIEANDT 


University of Helsinki 


Perceptual psychology today inclines 
toward phenomenological terminology. 
It deals chiefly with impressions of ex- 
perienced reality. Recently perception- 
ists have been interested in physio- 
logical correlates of these impressions. 
Because very complicated physiological 
relations underlie even the primary or 
simple perceptual processes, is it not 
natural, then, that psychological termi- 
nology should be tentatively phenome- 
nological? Only consider a few of the 
contents of immediate experience that 
can be easily described in phenomeno- 
logical terms—round, complex, closed, 
threatening, balanced, precise, adaptive 
—which are functionally dependent on 
very complicated underlying structures. 
In the words of Eino Kaila, “Consid- 
ered functionally the Gestalt quality is 
always very complicated: actually it 
can be simply described” (4, 5). Also 
from modern personality psychology one 
can reach the same conclusion. Person- 
ality diagnosticians employ descriptive 
(phenomenological) expressions from 
colloquial speech which often represent 
rich unanalyzable manifolds of func- 
tional relationships. 

The problem of a unified explanation 
of our perceptual world implies a re- 
search program to investigate the con- 
ditions of our impressions of objects. 
Why do we assume a “substance” or a 
“material” as the carrier of perceptual 
thing-quality? The phenomenal object 
gains in reality when the impression is 
multidimensional. The material in ques- 
tion must not only offer itself optically, 
but also preferably tangibly. We adopt 
in this respect the classical reality cri- 
teria of St. Thomas. By means of multi- 


dimensional analysis we can, in my 
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opinion, also achieve scientific progress. 
However phenomenological the termi- 
nology, we can probably discern cer- 
tain systems of relations by means of 
an analysis of our coordinated sensa- 
tions, thus avoiding the diffuse impres- 
sion of “substance.” When proceeding 
to map these systems of configurations 
in terms of quantitative relations, the 
science of psychology can make progress 
towards becoming a natural science. 

In my opinion, the most important 
contribution to this problem was made 
by David Katz in Tke Modes of Ap- 
pearance of Colours and Their Depend- 
ence on Individual Experience, 1911, 
later revised as The World of Colour, 
1930 (7, 8). When he described the 
phenomena of “surface color,” “film 
color,” and “space color,” he could not 
guess how important these ventures 
would later prove to be for psychologi- 
cal theory. When he referred to the 
meaning of the reduction-screen and 
formulated his principle of “field-size,” 
he implied a setting rich in the rela- 
tional theory of explanation, to which 
we today are indebted for the possi- 
bility of intermodal theory. We must 
give Katz the credit he deserves when 
he maintains, in his Gestalt Psychology 
(9), that we previously were concerned 
too exclusively about the psychology of 
the visual sense. Probably there is for 
this preference a perceptual-psychologi- 
cal explanation. There exists in the 
optical realm the possibility of com- 
bining certain cues for “thingness” with 
more elementary color qualities—for ex- 
ample, microstructure of the surfaces, 
form and size relations, general lighting, 
etc. The perceived colors can, in other 
words, be exclusively related to optical 


316 KAI von 
phenomena. The conditions of object- 
perception can consequently be reduced 
by operations not exceeding the optical 
sphere. And vice-versa: one can ob- 
jectify “pure” color sensations without 
leaving the optical modality; one sim- 
ply coordinates them with form impres- 
sions. The leading position of the visual 
sense makes this many-sidedness pos- 
sible. There is no reason why color 
impressions should immediately be re- 
ferred to other modal contents, for opti- 
cal form and space are best adapted to 
“take them over,” 

Similar considerations apply to the 
perceived world of touch. In his Struc- 
ture of the Touch-World (7), Katz de- 
scribes the modes of appearance of tac- 
tual experience. How is this modality 
objectively rendered (objectivized) in 
terms of palpable images? Only men 
with normal visual images (not the 
born-blind) are able to objectivize their 
exclusively tactual impressions. Their 
optically constructed coordinate-system 
is obviously able to supply information 
about simultaneous shape-gestalts in the 
environment. 

Let us consider some examples: 

Surface-touch provides us a thing- 
impression whatever the position and 
the distance of the touched object. 

Space-filling touch will be directly ex- 
perienced in walking in a strong wind. 
A similar experience is also felt when 
one moves one’s extremities quickly 
through a liquid. These cues hardly 
provide a basis for objectivizing, for a 
definitive object cannot be experienced. 

Therefore one could consider this space- 
filling phenomenon as an analogy of film 
color. 

Space-filling surface quality can bring 
with it objectification, especially as it 
concerns itself with a relatively small 
hard object behind the material jn ques- 
tion. The situation cannot be compared 
with the perception of a surface color 
“through” or “behind” colored illumi- 
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nation. The configuration brought out 
by illumination has its counterpart in 
the working of the tactual (kinematic) 
or haptic framework. The movement of 
the hands or of the felt surface yields 
an articulation comparable to the ef- 
fect of illumination in vision. 

The born-blind reach a certain state 
of phenomenal objectivity on the basis 
of their simultaneous haptic and kin- 
esthetic impressions. But, as is well 
known, their tactile-kinesthetic gestalten 
remain only formless and diffused (12). 

The sense of hearing can mediate 
thing-impressions only if the auditory 
stimulation comes combined with optic 
or haptic stimuli simultaneously and is 
localized in the same direction. The 
tendency to identify auditory impres- 
sions with visible things is shown in 
spiritualistic seances when ventriloquist 
mediums create illusions of sounds issu- 
ing from megaphones. In discussing ob- 
jectified noises we find for the first time 
the attempt to deal with circumstances 
that Katz had not discussed in any of 
his works; various stimuli (noise, smell, 
taste, and pain) can be combined to 
yield “thing-characteristics,” but only if 
intermodal qualities are involved. On 
the other hand, the dimensions of re- 
ality, e.g., of “noise-objects,” can be re- 
duced, so that finally only the pure noise 
remains. As is commonly known, such 
“pure sensation” is never found in natu- 
ral surroundings. It can only be ex- 
perienced in laboratories as a result of 
artificial refinement. The objectiva- 
tion of noise depends therefore either 
on optic or tactual stimuli simultaneous 
with the auditory quality. Neverthe- 
less, sounds do have phenomenal qual- 
ity. The tone-colors of different musi- 
cal instruments are evidently not alike. 
The strongest loudness-invariance mani- 
fests itself in those tone qualities that 
are functionally connected with the 
most complex relation-systems, as in the 
case of the human voice (14). 


SASS. SSE SSS EE 


TOWARD A UNITARY THEORY OF PERCEPTION 317 


Katz’s fruitful idea can also be ap- 
Plied to smell, taste, and other types of 
perceptions. I have also distinguished 
phenomenal modes of appearance for 


. these modalities. Henning (3) describes 


several objectivizing and resolving proc- 
esses in the sphere of smell sensation. 
I have postulated floating smell and ob- 
jectified smell as phenomenal modes, and 
correspondingly swelling taste and ob- 
jectified taste impressions. A floating 
smell and a swelling taste can only be- 
come objectified with parallel optic and 
haptic stimuli, respectively. The crux 
of the question is found with warmth, 
cold, and pain perceptions. Warmth 
and cold may remain isolated in float- 
ing or condensed states, or can be ob- 
Jectified with simultaneous optic or hap- 
tic impressions. Pricking pain (but not 
other forms of pain) is objectified (1), 
ìn part, in terms of stimulation pro- 
vided by stabbing instruments (needle, 
knife, etc.) because of the added tactile 
and perhaps also the form component. 

Our phenomenal object-configurations 
are, therefore, for the most part inter- 
modally determined. It is impossible 
for us to imagine a nonintermodal world 
of things. The different modal systems 
do not seem so hermetically sealed from 
each other as was thought in the days 
of J. Miiller. The senses constitute a 
single system. For man, undoubtedly, 
the world of sight is the most impor- 
tant, but surely a purely visual world 
would be an abstraction. Such a thing 
is never found in empirical reality. It 
is, for example, impossible on the basis 
of visual impressions, when the skin is 
anaesthetized, to decide when and at 
what point the surface of the body 
becomes excited above the threshold. 
However, the perception of one’s bodily 
Orientation in a dimly lit room is a 
function of a visual ideational structure. 

But there are also cases where the 
haptic and vestibular controls of the op- 
tical organization are in conflict. Only 


harmony between the input systems in 
question can restore the equilibrium of 
the perceptual field. Evidently these 
optic and haptic components possess 
special integrative functions for the 
“real” gestalts. The reality-impression 
of objects, their material and substan- 
tial character, is probably grounded in 
the repeated anchoring of phenomenal 
objects. These “moorings” arise from 
the coordination of the perceptions with 
basic optic and haptic elements, that is 
to say, from the process of objectivizing. 
“Rigid objects” seem to depend on co- 
operation of the visual and auditory 
senses. It seems evident that no gestalt 
occurs on the basis of taste, smell, vi- 
bration, warmth, cold, or pain impres- 
sions. Acoustic impressions can pass 
over into gestalten, but the time factor 
is then an organizing element. Tempo- 
ral organization yields a dynamic, suc- 
cessive gestalt. In this, the “cortical- 
ized level” of the auditory sense plays a 
decisive role. Smell and taste do not | 
have “melodies” because of the impos- 
sibility of object-gestalten at this level. 
From the foregoing we can conclude 
that organized perception (that is to 
say, at the cortical stage) is really a 
system of relationships. Evidently we 
react to certain relations in which we— 
phenomenally speaking—experience con- 
tents. In this concrete relational envi- 
ronment there exist, to be sure, various 
specific stimuli. They exert their influ- 
ence on our afferent nerves. That is the 
physiological side of the perceptual proc- 
ess. The psychological side of the proc- 
ess implies that the organism reacts as 
a whole to a relational system of stimu- 
lus-effects. These relational systems are 
almost unsurveyable in their complexity, 
e.g., with respect to the functional foun- 
dations of a perceptual object. Until 
recently we have been able to analyze 
quantitatively these complex wholes 
only with respect to some details of 
their appearance. Every stimulus is, to 
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paraphrase Metzger, a stimulus-system, 
besides having its specific function. It 
also merges with the whole stimulus-sys- 
tem in its totality (11). From this fact 
it follows that we get firmer and more 
exact perceptual achievements, the more 
the sporadic stimulus-components con- 
currently connect themselves to a frame- 
work. In: this sense the well known 
facts of phenomenal object-constancy 
(size, form, and color constancy) de- 
pend on a richly organized pattern of 
stimulus-relationships. 
We have already mentioned Katz’s 
statement about “field-size.” The ob- 
ject-color seems the more “retinal,” i.e., 
more nearly pure film color, the smaller 
the field in which it is presented. Katz 
could thus establish that the color con- 
stancy of perceived objects under sev- 
eral colored illuminations increases as 
the field of view either is enlarged or 
seems to be enlarged. Metzger has 
shown convincingly that these findings 
do not depend on the limitation of the 
field of vision. The opportunity to 
freely look around is, of course, impor- 
tant, since better possibilities of sepa- 
rate analyses of the constant chromatic 
components of the illumination are then 
at hand. Free observation with moving 
eyes affords the perception of numerous 
“systematic stimuli.” But the impover- 
ishment of this clue can also be ob- 
tained by kinds of reduction other than 
by cutting down the field of vision. 
Metzger’s beautiful experiments with the 
homogeneous total field show that we 
can give to the field of vision any size 
without obtaining thing-constancy (10). 
The articulation of the stimulus-field is 
decisive. Minimal articulation can also 
be a consequence of too weak an illumi- 
nation. In this way “reduction” of 
color can be brought about, 
When we say that Katz is right, we 
must comprehend generally the validity 
of what he has formulated. Tt is valid 


for more than vision only. The con- 
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cept of “reduction” is applicable to all 
senses for which objective patterning 
exists. The effect of the visual “reduc- 
tion-screen” is really only a special case. 
We meet everywhere in all stimulus 
situations invariant perceptual objects, 
previously encountered and formed. 
And, analogously, we always find a de 
clining degree of invariance when the 
reciprocity of the concomitant stimuli iS 
reduced by any method. We must not 
think that this reduction can be ob- 
tained by identical technical means in 
all the different sense-spheres. The 
multiple stimuli for vision are obtained 
by the scope of the field of view or by 
the illumination of the field of view- 
For haptics, it is through the kinesthetic 
sense, for instance, the finger tips. For 
the invariance experiments in olfactory 
and gustatory perception, a reduction 
can only be obtained by reducing the 
possibilities of objectivation (through 
gradual elimination of vision and touch). 
The meaning of “articulation stages” 
can be very beautifully clarified by ex- 
periments on loudness-constancy with 
changing distance. The human voice is 
the most constant and least variable 
sound, while a buzzer-sound has the 
lowest invariance level (14), 

We now leave this train of thought 
and occupy ourselves with some related 
points of view. To be perceived, ob- 
jects must produce spatial location. TPE 
reaction to the “relational system” that 
has been indicated above predicts that 
we move into our perceptual products ie 
certain invariant coordinations. The 4% 
velopment of the subjective coordinat? 
system belongs among the most inter 
esting chapters of the psychology 
perception. The whole field of per, 
tion possesses the same validities 2° je 
have established through the n 
thing.” A “bare object” represen” s- 
peculiar if not limited relation’ yal 
tem. The localization of the perc! ppe 
objects occurs in connection Wit 
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three orthogonally constructed interact- 
ing planes, the vertical, horizontal, and 
sagittal. The medium line of the per- 
ceptual frames is assigned to these co- 
ordinations with amazing accuracy. The 
phenomenal local signs, “right,” “left,” 
before,” “behind,” etc., apply to this 
system. The picture which we experi- 
ence to our right remains “the same” 
When we turn our eyes and locate it to 
the left.” By so doing, we have joined 
the time dimension to the coordinate 
system. The example of the picture 
demonstrates the invariance of the ob- 
ject in time. 

Once more: Because the precise and 
clearly experienced external world sup- 
poses a mediating relational system in 
organized detail, its concreteness and 
stability are completely lost if we ex- 
perimentally reduce the optical-acousti- 
cal-haptical stimulus pattern to a mini- 
mum. In experiments with movement 
perception (13) such loss can be 
demonstrated. If the subject, for in- 
stance, is told to estimate the speed of 
an evenly illuminated paper square 
which moves sideways with uniform 
speed in the frontal plane, one thereby 
finds considerable variance between the 
successive estimations when the experi- 
ment is carried out in a completely dark 
and “reference-poor” environment for 
the motion. If more richly organized 
reference-criteria are admitted (normal 
daylight, “passing things,” structured 
background, and the like), one immedi- 
ately judges with greater accuracy an 
with decrease in the variance of the ob- 
servations. The largest overestimations 
of the speed are elicited by the most 
reduced configurations. One lone clue, 
a stable light source in a dark room, is 
often enough to bring about a consider- 
able decline in the errors (about 15%) 
and the variance (about 5%). 

s Through another experiment at, the 
Se logical Institute of the Univer- 
ity of Helsinki, it was found that the 


subject succeeded essentially in a “Jo- 
calization task” with sound sources in 
a dark room if he could use a “reference 
sound source” which was attached to a 
slightly movable long pole between the 
subject and the “goal.” The subject 
himself could “regulate” this sound- 
source (buzzer). Most effective was 
the “acoustical-optical method,” where- 
by the pointer was also illuminated 
(14). 

Of all the phenomena which produce 
a permanence in the frame of the space- 
time-coordinate system, our empirical 
self is probably the most important. 
We notice most distinctly the temporary 
interference and functional crises of the 
percepeual-psychological system in the 
not-too-rare cases where we wake up in 
unknown places, €g. in darkness. All 
conceivable clues for the perceptual 
function seem to be absent—even the 
possible pressure experiences and kines- 
thetic impressions which are recalled by 
our position in bed. And nevertheless 
self-permanence, which we doubt least, 
shapes the perceptual content—apart 
from some pathological conditions. AC- 
cording to Gadelius (2), the empirical 
self will, through this temporal invari- 
ance, be constituted as a perception 
phenomenon with the perceptual com- 
ponents of the body-image. It is one 
of the highest acquisitions of percep- 
tion. Psychology has doubtless shown 
that the invariance of the self-experi- 
ence depends on conditions similar to 
the conditions responsible for the per- 
ception of objects. Kaila observes in 
this connection: 


The so-called “phantom limbs” of the am- 
putated part of the body show what tenacious 
constancy and hallucinatory intensity this 
body-scheme has: a war invalid for instance, 
the “one who has lost his leg, is initially ex- 
posed to danger because he experiences hal- 
lucinations of his lost leg, 50 that when he 
gets up out of bed, for example, he makes a 
step with the non-existing leg (5, P- 164). 
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With the most trivial and common- 
place perceptual systems—the phenome- 
nal objects—as a starting point, we have 
proceeded to the most complex thing 
that a human being is able to perceive 
at all: his empirical self. The main 
conclusion to be drawn from our dis- 
cussion is obvious: It is the stimulus 
pattern as a system of relations, not as 
representing some “substantial stuff,” 
which immediately gives rise to the im- 
pression of a three-dimensional world. 

In summary, I believe the position 
represented here paves the way for 
modern perceptual psychology to arrive 
at functional principles of explanation. 
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In 1904 Herbert Sidney Langfeld re- 
signed his position. as Secretary to the 
Naval Attaché of the American Embassy 
in Berlin, for he had decided against a 
career in the foreign service in favor of 
graduate study in psychology at the 
University of Berlin. This decision 
brought a real loss to American foreign 
service, for in the very best sense Lang- 
feld was a diplomat to the manner born; 
but at the same time it was an impor- 
tant gain for American education in gen- 
eral and psychology in particular. Lang- 
feld was the last American to study 
under the first generation of German 
psychologists, and revealed during his 
long career in this country, both as 
teacher and administrator, a serene wis- 
dom and detachment together with a 
breadth of outlook which may have re- 
sulted in part from his experiences in 
the German universities. The psycholo- 
gists with whom he studied in Germany 
were also philosophers with imposing 
ideas and a sweep of imagination which 
often made up for their lack of knowl- 
edge regarding the experimental tech- 
niques and statistical analyses in which 
our students today are so carefully 
trained. 

Dr. Langfeld received his early educa- 
tion at the Central High School of his 
native Philadelphia and at Haverfor 
College. After he enrolled in the Uni- 
versity of Berlin he did most of his work 
under Carl Stumpf and received his 
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Ph.D. under him in 1909. He published 
several articles in the Zeitschrift fiir 
Psychologie in the field of visual phe- 
nomena, rather than in Stumpf’s favor- 
ite field of audition. At the same time 
he showed a profound interest in art 
which remained with him throughout his 
life, and spent many hours in museums 


-and in the classrooms of art historians 


and aestheticians. On returning to this 
country he accepted a position as in- 
structor in psychology in Harvard Col- 
lege. When he left Emerson Hall fif- 
teen years later he had served Harvard 
as associate professor and director of 
the laboratory for a number of years, 
had written articles on a variety of 
topics both within and outside the field 
of experimental psychology, and in 1920 
had published his most distinguished 
and best-known book, The Aesthetic At- 
titude. He had also written, in col- 
laboration with Floyd Allport, a labora- 
tory manual which was widely used in 
various universities for a number of 
years. 

Langfeld was a good lecturer, skilled 
in clear exposition and in the use of 
language “ynderstanded of the people.” 
He was not dramatic, nor did he ever 
lose his natural dignity by trying to be 
popular or obvious. The duller students 
were not impressed, but he had the 
teacher’s reward of knowing that those 
who took the pains to follow and grasp 
his ideas retaine an enduring and fa- 
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vorable impression. Tucked away in his 
desk Mrs. Langfeld recently found a 
petition signed by a large number of 
Harvard graduate students begging him 
to offer a special course in systems of 
psychology because of their conviction 
that he could give them a scope and 
depth of insight which they felt they 
badly needed. In a recent volume, Col- 
lege in a Yard, one of the many dis- 
tinguished contributors who recalled 
their memories of famous teachers at 
Harvard wrote that “In psychology and 
aesthetics there was Herbert Sidney 
Langfeld, whose occasional ironies were 
as nice, as elegant and impeccable as his 
habit.” 
In 1924, on the completion of Eno 
Hall, Langfeld accepted a call to be- 
come Professor of Psychology at Prince- 
ton and to succeed Howard Crosby 
Warren as Director of the Psychological 
Laboratory. Warren had managed to 
separate psychology from philosophy 
only three years before, with the result 
that he and Langfeld were burdened 
with the heavy responsibility of building 
up a department faced with indifference, 
if not opposition, from the philosophers 
and the administration towards what 
William James once called “that nasty 
little subject.” In 1932 Langfeld be- 
came Chairman of the Department and 
in 1937 was awarded the Stuart Pro- 
fessorship. During the years until his 
retirement in 1947 the department more 
than doubled in size of staff and in the 
scope of its activities. Langfeld be- 
came deeply devoted both to the town 
and gown of Princeton, and resented any 
criticism of the institution or the de- 
partment almost as though it were a 
personal affront. On rare occasions he 
allowed himself the privilege of speak- 
ing with disfavor about minor matters 
relating to Princeton, but it was not 
good form for anyone else to do so in 
his presence. He was admired by grad- 
uate students and members of his staff, 
for they accepted his pride in their work 
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as a challenge to their best efforts. They 
always referred to him affectionately as 
“Langie,” but not when he was within 
earshot. When Wever and Bray made 
themselves famous by their studies in 
audition, he experienced the satisfaction 
and delight of one who was somehow 
sure that only in his laboratory could 
such important ideas be born. 

Throughout his writings there appear 
numerous variations on the theme of a 
motor theory of consciousness, or, as he 
preferred to call it, a response psy- 
chology. This view has always been 
closely associated with his name, al- 
though in one of his articles he dis- 
claimed any credit for formulating the 
theory and showed in impressive detail 
its origins and development from Greek 
Philosophy down to the immediate pred- 
ecessors of behaviorism, Undergradu- 
ates both at Harvard and Princeton 
were always warned by their tutors and 
Preceptors that they should know the 
motor theory forward and backward, 
for no matter how varied the wording of 
an examination question, there could be 
only one correct answer. Langfeld was 
never able to believe, as did some of his 
German Professors, that consciousness 
can be isolated for phenomenological 
study in its own right. It is too closely 
linked to efferent as well as to afferent 
events. The data of sense, as well as 
those of Perception, feeling, cognition, 
and emotion have their terms interpene- 
trated and closely bound up with motor 
processes. 

This view of mind, with its emphasis 
on motor response, led into and was 
then swamped by the behaviorism of 
the 1920s. With radical behaviorism, 
however, Langfeld had no sympathy ; 
and it is gratifying to realize that he 
lived long enough to see his own view; 
albeit with new and more elaborate 
terminology, restored to its proper and 
appointed place in the psychology of 
today. He was often amused by the 
tortuous labor of those who had not 
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fully recovered from the behavioristic 
revolt to find “objective” sentences for 
the description of what at one time 
would have been called simply direct 
experjence or consciousness. He knew 
that consciousness by any other name 
would still be the same thing, and he 
continued to insist that the data of 
classical psychology are an inescapable 
part of the intricate chain of external 
and internal events that make up the 
totality of mental life, and that these 
events are best looked upon and labeled 
as neutral rather than either subjective 
or objective. His refusal to become in- 
volved in any sort of body-mind con- 
troversy was undoubtedly reinforced by 
his close association with the philoso- 
pher-psychologist, Edwin B. Holt, who 
frequently argued that attempts to tell 
what is objective and what is subjective 
are a waste of time for the simple reason 
that they never get anywhere. 

In his writings on aesthetics, and 
especially in Tke Aesthetic Attitude, 
Langfeld drew heavily on the doctrine 
of empathy for an explanation of the 
very special and affective perceptual 
qualities of art. Hardly anywhere in 
English can a more persuasive account 
be found of the manner in which in- 
cipient muscular movement may con- 
ceivably be the basis of the judgment 
that a work of art is exciting, wistful, 
melancholy, sprightly, or what not. The 
problem is among the oldest in specula- 
tion about art and has given no en of 
trouble to philosophers, art critics, and 
psychologists. It seemed absurd to 
argue that the perceived object of art 
itself could be melancholy; and there 
was good evidence to show that those 
who described a work of art in such 
manner were not ordinarily in a melan- 
choly state. Here was a problem, for 
if neither the object nor the subject is 
melancholy, where does the quality come 
from? Langfeld seized upon Lipp’s doc- 
trine of Einfühlung and worked it for 
all it was worth. Aesthetic qualities 


must somehow have their origin in the 
subject, but are immediately and un- 
consciously projected upon and identi- 
fied with the object, so that it is the 
object which seems to be melancholy. 
The notion is an obvious variation on 
the theme of a motor theory of percep- 
tion, and was therefore doubly congenial 
to Langfeld because of its close relation 
to his more general point of view in 
psychology. 

In recent years, however, the theory 
of empathy has been severely attacked, 
and Langfeld was placed in a much 
more defensive position with respect to 
many of the arguments to which he was 
deeply attached. The implications of 
Gestalttheorie for art, especially in some 
of the devastating paragraphs of Koffka 
and Kohler, were drawn upon by a num- 
ber of philosophers and psychologists to 
reveal the serious limitations of empathy. 
The doctrine gives too little credit to 
the genius of the artist. It implies that 
the subject creates from within himself 
the glories of the glass of Chartres, the 
majestic power of the sculpture of Mi- 
chelangelo, and the wistful melancholy 
of some of the minuets of Mozart. The 
artist merely puts together a sort of 
neutral perceptual design and then waits 
for the subject to perform the act of 
artistic creation. Because this act seems 
too much to hope for, the pendulum has 
swung in the direction of assigning to 
the artist the ability to create configura- 
tions which themselves possess the żer- 
tiary qualities of longing, serenity, erotic 
passion, sadness, gaiety, etc. Sadness 
may be present in visual and auditory 
as well as in organic impressions. It is 
this possibility of multiple location that 
has caused the confusion in aesthetic 
theory. A person may feel sad. A piece 
of music may sound sad. But neither 


state necessarily depends on the other. 
It is not empathy which gives to the 
slow movement of the Eroica the sense 
of poignant grief. It was Beethoven 
who performed that miracle, and for 
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those with ears to hear, it is a source of 
pleasure rather than of grief, although 
the judgment may be made that the 
music is profoundly sad. Yet the doc- 
trine of empathy is still widely held, 
and may in time recover from its 
wounds. There are few views in philos- 
ophy and psychology which are com- 
pletely down and out. In any case, 
Langfeld’s treatment of Einfühlung is 
masterly and for a long time will remain 
required reading for every student of 
aesthetics, for even if a student enter- 
tains doubts about empathy he must 
know how to answer the penetrating 
arguments which Langfeld has marshaled 
in its support. 

Langfeld’s eclectic interests in psy- 
chology and his respect for good writing 
made him an ideal editor. As the years 
went on, he spent an ever increasing 
amount of time in going over manu- 
scripts with painstaking care for profes- 
sional journals and for the publishing 
house of John Wiley and Sons. In 1931 
he was appointed editor of the Psycho- 
logical Monographs, and then for thir- 
teen years served the Psychological Re- 
view in a similar capacity. He was joint 
author and editor of a number of texts, 
the best known of which was the famous 
BLW, a factual foundation for psy- 
chology which he, Boring, and Weld 
prepared for such advanced introduc- 
tory courses that many graduate stu- 
dents have used it to bone up for their 
final oral examinations. His services to 
psychology have been recognized in a 
variety of ways. He was an honorary 

member of the Belgian and Swedish 
Psychological Associations, and in 1930 
was elected president of the American 
Psychological Association. His election 
to the Pontifical Academy of Sciences in 
1949 and the private audience with His 
Holiness Pope Pius XII gave him, an 
Episcopalian, deep pleasure and satis- 
faction, as did also the award of Docteur 
de l'Université which he, Michotte, and 
Piéron received from the University of 


Montreal at the time of the 14th Inter- 
national Congress of Psychology. 

Throughout his long professional ca- 
reer Langfeld provided a valuable link 
between psychology in America and psy- 
chology in other countries. He was 
Chairman of the Committee on Inter- 
national Cooperation in Psychology of 
the National Research Council, and 
Permanent Secretary of the Interna- 
tional Congress of Psychology, and in 
his many travels abroad he established 
close personal and professional relations 
with investigators in other countries. It 
is safe to say that no American main- 
tained such friendly contacts with so 
many psychologists in other lands. 
Whenever his professional friends had a 
chance to come to this country, they 
hastened to accept Langfeld’s invitation 
to visit him in Eno Hall and to partake 
of the hospitality so generously offered 
by Dr. and Mrs. Langfeld in their 
Princeton home. 

Retirement made little change in the 
toutine of Langfeld’s life except for the 
freedom from classroom instruction and 
administrative responsibilities, He came 
regularly to his office in Eno Hall, carried 
on voluminous correspondence regarding 
professional matters all over the world, 
wrote articles and prepared lectures to 
be delivered before learned societies in 
this country and abroad, watched with 
deep interest but without any interfer- 
ence the activities of the department 
over which he presided for so many 
years, and continued to extend the hos- 
Pitality of his home to graduate stu- 
dents, members of the staff, and espe- 
cially to visitors from abroad. Shortly 
before his death he spoke informally at 
one of the regular seminars in Eno Hall. 
He left an indelible impression both 
with staff and students of his ability to 
expound clearly the broad historical 
background for his sharp comments 
about contemporary psychology. 
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PSYCHOLOGICAL STRUCTURE AND PSYCHOLOGICAL 
ACTIVITY? 


HELEN PEAK 
University of Michigan 


The science of behavior, like all 
science, is preoccupied with relations 
between events. For this reason, it 
is important to analyze the diverse 
relational concepts that have been 
employed in describing psychological 
structures, in order to discover 
whether unique characteristics belong 
to concepts such as similarity, opposi- 
tion, complements, antecedents, and 
consequents, or whether they are all 
reducible to simpler and more funda- 
mental relational properties. Another 
paper (28) has suggested that all kinds 
of psychological structures can be 
described in terms of parts or events 
standing in spatial and temporal rela- 
tions to each other, the space being 
conceived in psychological terms; and 
that every arrangement implies the 
course that activity will take when the 
Structure is activated at specified 
Points. It will be the task of the 
Present paper to elaborate this analy- 
sis in somewhat more detail and to 
suggest how selected structural pat- 
terns influence the course of events in 
Perceiving and acting. 

The properties of structure will be 
described and certain basic principles 
about the relation of structure an 
activation will be proposed. These 
will be discussed as they apply to 
Phenomena such as context effects, 
assimilation and contrast, the alge- 

raic summation of positive and nega- 
tive attitudes to produce intermediate 
attitudes, judgment within “bal- 
anced” and “unbalanced” structures 
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as described by Heider (11), and the 
persisting activity of the motivated 


state. 


PSYCHOLOGICAL STRUCTURE 


If the course of psychological events 
is to be predicted from information 
about the activation pattern imposed 
on a known structure, the meaning of 
the central concepts involved must be 
made clear, most particularly the 
notions of structure and activation. 

Stated briefly, activation is a term 
similar in meaning to stimulation but 
more general in the sense that the 
change which activates a structure may 
be either the energy change which 
takes effect by way of sense organs (a 
stimulus) or a central event, such as 
an aroused concept, which in turn 
produces change in or activation of 
another structure. Activation is 
transmitted change. 

The term structure is used in a very 
broad sense to refer to a system of 
relationships between identifiable 
parts. Itisa hypothetical construct? 
to be inferred from events observed 
under specified conditions and, like all 
such constructs, once inferred, it be- 
comes the basis for predicting be- 
havior on subsequent occasions when 
there are reasons to believe that the 


a less restricted meaning in 
given it by Meehl and Mc- 
‘The ideal is, of course, to 
usly both at the 
rational level in 
will be inferred 
the various events to which it is assumed 
frofontribute. But the degree of rigor of the 
ot determine whether it is a 


definition does ni n 
construct or its hypothetical character. 


2 The term has 
this paper than is 
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system of relations has not changed. 
For example, if it is ascertained that 
under free association instructions the 
stimulus word black elicits the re- 
sponse white 80 percent of the time in 
some population or in some individual 
on a series of occasions, then it is 
inferred that there is a relationship 
between the stimulus-aroused concept 
and another part of the structure 
whose activation produces the ob- 
served response. This familiar fact 
that organisms retain dispositions to 
react in certain ways must imply that 
the underlying structural arrange- 
ments have a basis in the nervous 
system. However, it is not our pur- 
pose to present a neurological model. 
Rather the intent is to describe certain 
patterns of relations which are com- 
patible with important psychological 
phenomena. The neurological sub- 
strate of these phenomena may or 
may not possess characteristics of the 
kind to be described. 
Now the components of a structure 
will vary not only with the subject 
matter of the particular science but 
also with the theoretical preferences 
of the scientist. But parts of some 
kind must be identified, whether they 
be called stimuli, roles, concepts, 
regions, R,, attitudes, cathected ob- 
jects, or drives—to mention a few 
miscellaneous constructs that have 
been regarded as segments of the 
matrix of events by equally miscel- 
laneous psychologists. Moreover, it 
should be noted that the components 
of a structure are not to be regarded 
as indivisible elements, for some com- 
ponents may be highly structured with 
relations existing between the parts, 
as in the complex concept of a friend, 
while others are relatively simple and 
homogeneous, as would be the concept 
of warmth. This means that there 
may be relations within components 
parts as well as between parts of a 
larger structure. 
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It has already been noted that the 
nature of the relations existing be- 
tween the parts of a structure will be 
a principal concern of this discussion 
and that the effort will be made to 
conceptualize the more complex rela- 
tions in terms of the concepts of posi- 
tion and distance. Therefore, these 
basic relational notions must now be 
defined. 


Position 


To assign position means, generally 
speaking, to place an event or a point 
in relation to one or more other events. 
In the simplest case, position depends 
on the process of identifying and 
distinguishing an event from other 
events, and to have the same position 
means simply to be in the same 
category. 

But position may be specified more 
precisely if events have been ordered 
on a continuum or arranged in a series 
which derives its order either from the 
amount of a property or from the tem- 
poral order of activation of the in- 
dividual parts of the series when one 
part is activated. In other words, 
position may be stated in terms of 
membership in a category (nominal 
position) or in terms of some amount 
of a property or position in a series 
(ordinal position, or a point on an 
interval or ratio scale). 


Distance 


The psychological distance from 
any point A to point B is defined in 
terms of the number of units or steps 
in an ordered series of some kind 
which intervene between A and B 
which are themselves part of the 
series. 

If, then, two points have only 
nominal position in separate cate- 
gories, their distance apart will be 
indeterminate unless the categories 
themselves can somehow be ordered: 
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In other words, the description of 
a distance must involve finding a 
method of ordering events before it 
can be established (a) that other 
points of a series do in fact lie between 
A and B, and (b) that there is some 
determinate number of such inter- 
vening events that define,the distance. 
Further, it must be noted that 
psychological distance has a special 
Property which makes it different in 
this respect from physical and mathe- 
matical distance: it need not be sym- 
metrical. The distance from AtoB 
ee necessarily the same as from 
ato A. For example, the occurrence 
of an event which is an antecedent 
will more often elicit the consequent 
as an associate than presentation of 
the consequent will elicit the ante- 
Cerent _It is inferred that the 
: istance is greater from consequent to 
antecedent than in the other direction. 
Although this is not the appropriate 
Place for a discussion of all the opera- 
tions for determining order and dis- 
ee among psychological events, a 
w examples of operations from which 
Psychological distance has been in- 
ferred will make it clear that all imply 
the foregoing definition. Distance 
between two points may be inferred 
rom judgments of the position of each 
Point on a scale. The subject matches 
pa reaction to the stimulus objects to 
Cae scale position and the difference 
rel eh represents the number © 
b €rvening units. Or distance may 
i judged directly where the subject 
io ked to state how far apart two 
imuli are. For example, he judges 
sae different John and Jim are in 
= clligence, and reportsit. The units 
i in the judgments and their dis- 
a ap apart may or may not have been 
dene by techniques such as those 
eloped by Thurstone (36), Coombs 

t! ), Guttman (35) and others but af 
ese methods imply the occurrence 


and identifiability of intervening steps 
of some kind. Furthermore, the rela- 
tive frequency and the latency with 
which one event follows another have 
also been considered to reflect distance 
between events. Indeed, probability 
of activation of one event by another 
is often taken as synonymous with 
distance (26). 

If it be assumed that each of these 
measures reflects in some degree a 
distance between A‘ and A” in psy- 
chological space, the major opera- 
tional problem is to find ways of 
removing the effects on behavior of 
determinants other than distance 
A’-A", for it is only when these irrel- 
evant determinants have been par- 
tialled out of each operation that the 
various operations for inferring this 
distance can be made comparable.’ 


Direction 
Direction involves determining 
position of some event with respect to 


two or more other positions. 
The course of activation can be said 


to have direction, but there is a ques- 
tion whether direction is a necessary 
concept for the description of struc- 
ture. Suppose that I know three 
boys, Tom, John, and Jim, whose IQs 
are 100, 110, and 115. It is possible 
to say that John’s IQ lies in the 
direction of Jim’s from Tom’s, and 
that Jim's and Tom’s are in opposite 
directions from John’s, in my psycho- 
logical structure. But this descrip- 
tion in terms of direction adds little 
toastatement in terms of position and 
distance. It is when predicting the 
course of activity within a structure 
that the concept of direction becomes 


d Marquis (12, p. 135) on 
the low correlation between indicants of habit 
strength, such as the probability of activation 
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useful, for it is necessary that in- 
formation about the points of input 
into a structure be provided, in addi- 
tion to knowledge of the structure 
itself, before the direction of events 
can be determined. It is the relation 
of subsequent events to the initial 
point of activation or the input point 
that gives direction. 


` ACTIVATION 


To say that a structure has been 
activated means that change from 
some source has set off change in a 
structure. Thus the energy change 
that is a stimulus event, such as the 
spoken word good, initiates a succes- 
sive wave of change in parts of a 
complex structure finite distances 
apart until the process leads finally by 
a series of activations to the spoken 
word bad. Potentiality for this series 
of events implicit in the structural 
arrangement has been actualized by 
the transmittal of change from one 
point in the structure to other points. 

In some cases activation produces 
only a temporary disruption of the 
structure which tends to be recon- 
stituted following activation. Thusa 
violin string is distorted when a bow 
is drawn across it but soon returns to 
its original state. In a somewhat 
different way, neurone structure is 
partially destroyed in responding to 
stimulation but is polarized again after 
the passage of an impulse. Again, the 
activation of a series of concepts, per- 
cepts, movements, or word responses 
involves structural change, but here, 
too, activity subsides and the original 

state of quiescence returns,‘ 

But activation may leave an altered 

structure, which means that the struc- 
ture does not invariably return to the 


‘The disruptive events of activation and 
the opposed tendency to reconstitute structure 
suggest entropy principles, activation corre- 
sponding to positive entropy and reconstitu- 
tion of structure to negative entropy (33). 


preactivation state. This is known 
to be the case when later activation 
at the same point, under conditions 
otherwise the same, produces a differ- 
entoutcome. Restructuring or learn- 
ing has occurred. The presentation 
of a complete learning theory is be- 
yond the scope of the present discus- 
sion, which addresses itself primarily 
to the problems of the immediate 
effects on perceiving and acting of 
activating a given structure at a given 
point or points. However, it may be 
said that any condition which has the 
effect of activating two structures 
close together in time increases the 
probability of a change in the relation- 
ship between the structures, Fur- 
thermore, the shorter the existing 
distance between the structures when 
activated, the more likely it is that 
their simultaneous activation will 
reduce the distance still further, The 
role of affective consequences in learn- 
ing cannot be considered at this time. 


ACTIVATION PRINCIPLES 


Structure is of interest only insofar 
as it determines the course of activity. 
The next step is, therefore, to analyze 
certain assumptions that must be 
made about the relation of activation 
and structure. Five principles are 
Proposed that appear to be necessary 
in order to derive theorems about ac- 
tivation patterns that develop within 
particular structures. The validation 
of these constructs will depend upon 
the success with which they generate 
Propositions which are verifiable. At 
present they derive their justification 
from the fact that they appear to be 
compatible with certain phenomena 
for which there is observation@ 


support. 
Distance and Activation 


It is assumed that the probability 
of activation of some point B when 
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is activated is an inverse monotonic 
function of the psychological distance 
between A and B, when other sources 
of the activation of B are excluded 
(Principle I). 

As already explained, the activation 
of any point in psychological space 
may be determined by input from 
more than one source, and it is only 
when activation of B from other 
sources is controlled that this relation 
can be assumed to exist between dis- 
tance A-B and the probability of 
activation of B by A. 

As a matter of fact, it must be the 
exceptional occasion when input to 
any point in a complex structure 
comes from only one source, and so it 
is necessary to make assumptions 
about the way in which activation 
from different sources combines to 
determine the next steps in the activa- 
tion pattern. There appear to be two 
basic patterns which demand different 
combination principles. In the first, 
a serial arrangement, activation of 
some structure at the end of the series 
depends on the occurrence of activa- 
tion in preceding steps of the series. 
In the second pattern, structures are 
arranged so that there is multiple 
input to the same point from two Or 
more sources. 


Serial Arrangement 


Ina serially arranged structure, the 
probability of activation of some 
point C at the end of the series when a 
point S is activated is the product of 
the probabilities of activating points 
intervening between S and C (Prin- 
ciple II). 

For example, in Fig. 1 the prob- 
ability that Sa will activate C equals 
0.63 (0.9 X 0.7). The probability 
that Sẹ will activate C equals 0.36 
(0.9 X 0.4). 
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Multiple Input to the Same Point 


The probability that any point will 
be activated is a function of the com- 
bined probabilities of its activation 
from all currently active structures. 
Given the probability that A will 
activate C and that B will activate C, 
the probability that A and B together 
will activate C is assumed to be equal 
to: 

1 — (Poa X Pcs) 


where Pca is the probability that A 
alone will ot activate C and Pcs is 
the probability that B alone will not 
activate C (Principle III). 

For the values given in Fig. 1, the 
probability that A and B together 
will activate C is equal to 0.77; that is, 
the combined probability is greater 
than the probability produced by 
either component (A or B) except 
when the probability of one or both 
terms is zero or unity. 

The logic of the assumption is 
simply this. The probability that C 
will occur when both A and B are 
activated is the sum of those occasions 
where A activates C but B does not 
(Pa X Ps), where B alone activates 
C but A does not (Pa X Pg) and 
where both activate C (Pa X PB). 
In other words, the relative frequency 
is reduced by the frequency of those 
occasions on which neither A nor B 
activates C (probability that C not 
activated = Poa X Pcs). This is 
the term subtracted from unity in the 
equation. 


Fic. 1. Serial arrangement of parts: 
S.A, A> C; > B, B—C. Multiple 
input to the same point: A —> Cand B => G 
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Examples of situations in which 
complex multiple input may be as- 
sumed to occur are numerous. The 
many studies which emphasize the 
effect of preparatory events, such as 
context, set, instructions, and per- 
ceptual response dispositions on reac- 
tions to stimuli, belong here. Howes 
and Osgood (15) showed, for example, 
that both the number of preparatory 
stimuli similar to a word and the 
interval by which those preparatory 
stimuli preceded a stimulus word in- 
fluenced the nature of the free associa- 
tion response made to the word. The 
preparatory stimuli may be assumed 
to have activated the same structure 
as did the stimulus word, and thereby 
to have increased the probability of 
the occurrence of a response depend- 
ing on activity in that structure. 
And Asch’s well-known demonstration 
(1) of the effects of prior presentation 
of trait terms, such as warm and cold 
on the interpretation of other traits 
attributed to an individual, may 
illustrate the same point. 

It appears that the influence of 
central motivational states on percep- 
tion and action may likewise be con- 
ceived in terms of multiple input. 
Many examples come to mind, such as 
the familiar studies of Atkinson and 
McClelland (2), in which TAT stories 
were written under different degrees 
of hunger motivation, with the con- 
sequence that references to eating 
and/or activities instrumental to get- 
ting food increased with increased 
time of deprivation. 

Again, the apparent increase in the 
breadth of the generalization gradient 
when drive level is high or with high 
anxiety may also involve this same 
principle, if it is assumed that anxiety 
or high levels of motivation set up 
diffuse activation and hence multiple 
input to all points on a gradient. 
Finally, it should be noted that the 
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process of abstraction should also be 
examined with this principle in mind. 
A property common to different 
situations is more likely to be noticed 
if an individual experiences a number 
of different events all of which possess 
the property. Thus the European 
who sees ten Americans chewing gum 
is more likely to abstract a concept of 
gum-chewing Americans than if he has 
only one such experience. The mul- 
tiple input to the gum-chewing prop- 
erty increases its probability of 
arousal. 


Event Units 


Bridgman has aptly described the 
problem of setting boundaries to 
physical events when he says, “I be- 
lieve the essence [of the identification 


of an event] lies in the discrete 
character of the event. . . , An event 


is by definition the aggregate of all 
phenomena bounded by certain discon- 
tinuities.” He then goes on to point 
out that this concept can only be an 
approximate one, because there are no 
such things in experience as sharp 
discontinuities. Rather, as measures 
become more refined “the edges of the 
supposed discontinuities become blur- 
red” (5, pp. 95-96). 

This is the problem of discovering 
cleavages in the matrix of events, the 
boundaries which result in some 
events being part of the same whole 
while others are parts of different 
wholes. This search for gaps between 
phenomena is involved in delimiting 
the boundaries of perceived objects or 
of concepts, in determining what is 
included in a trait complex and what 
is excluded, in bringing it about that 
events will be “assimilated” or ‘‘con- 
trasted.” 

It is proposed that the discon- 
tinuities that establish boundaries in 
psychological space are of two kinds: 
(a) the distance between events: 
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decreasing distance tending to increase 
the likelihood that events will be part 
of the same whole or within the same 
boundaries; and (b) the probability of 
simultaneous arousal, the likelihood 
of events being included in the same 
whole increasing with the probability 
of their simultaneous arousal (Prin- 
ciple IV). 

_ This means that anything which 
increases the probability of activating 
either or both of two points will in- 
crease the probability of their occur- 
ring together, for the higher the 
probability of their occurrence, the 
more likely they are to occur together. 


Duration of Activity and Action Dec- 
rement 


The repeated activation of the same 
structure at short intervals leads to 
activity decrement (Principle V). 

The spread of activation will depend 
not only on distances between the 
parts of a structure and the pattern 
which determines the occurrence of 
multiple input at certain points, but 
it will also be influenced by the state 
of the to-be-activated structure. 
There is considerable evidence to in- 
dicate that the repeated activation of 
the same structure tends to produce @ 
temporary reduction in activity; that 
is, activity results in some sort O 
change in the state of the system SO 
that it is less reactive. This is what 
Lewin called satiation (21) and Hull, 
reactive inhibition (16). Köhler has 
provided evidence of figural after- 
effects which points to 4 similar 
Process (19). And more recently 
Walker and his associates have also 
Confirmed the existence of adecrement 
in activity as a result of activity (37)- 

It would appear in general that the 
same conditions which result in de- 
veloping clear foci of activity and 
sharp boundaries (IV) will tend to 
speed the onset of decrement. This 


suggests that short distances between 
the parts of a structure should lead 
to decrement, provided either that the 
parts are interrelated in such a way 
that they activate each other, or that 
input to the parts from some external 
source continues so that parts are 
activated repeatedly and simultane- 
ously. However, the persistence of 
activity within a structure is not 
simply determined, for two conse- 
quences of distance between parts 
work in opposed directions. On the 
one hand, repeated activity at the 
same point is more likely to occur at 
short distances between parts, and 
hence to produce decrement. But it 
is also true, on the other hand, that 
great distances may reduce the prob- 
ability of activation so much that 
activation ceases. The condition of 
maximum duration of activity is to be 
expected, therefore, at some distance 
between parts which is great enough 
to prevent immediate onset of decre- 
ment and yet small enough to provide 
relatively high probability of con- 
tinuing transmission of activation 
from one part of the structure to 
another, under the existing conditions 
of input to the structure. 

The application of this principle 
suggests that a simple stimulus object 
like the conventionalized drawing of 
a face would yield not only a simpler 
and more circumscribed activation 
pattern than will the actual presence 
of a person, but that the former 
stimulus should result in a more 
prompt perceptual act and a more 
rapid onset of decrement, so that the 
perceptual act should drop from 
“attention”? more promptly. 


Tue DESCRIPTION OF SELECTED 
STRUCTURES 
We turn now to the task of describ- 
ing selected structures that are implied 
by familiar relational concepts such as 
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similarity, opposites, complements, 
antecedents, and consequents. These 
classes of relationship are not sharply 
delimited nor have they always been 
defined in the manner here proposed. 
We believe, however, that the struc- 
tures to be described are common 
ones, and that the particular terms 
used as labels are not without prece- 
dent. As already indicated, it is our 
intention to describe these patterns in 
terms of the concepts of position and 
distance, as previously defined. And 
in each case, the question will be: 
How does this arrangement affect the 
course of activation when the struc- 
ture is aroused at different points? 
Obviously, a very large number of 
structural arrangements and of input 
patterns may be conceived. It has 
been necessary, therefore, to limit the 
task to a description of the structures 
mentioned above and to a considera- 
tion of the course of activity in such 
structures when certain simple input 
patterns are operating: activation at 
one point; at two points different dis- 
tances apart, resulting in such out- 
comes as assimilation, contrast, or 
algebraic summation of opposites; and 
at three points, which will produce 
resultants of the kind that Heider has 
called balanced and unbalanced (11). 


Similarity Structures 


It has been common practice to 


define similarity conceptually in terms 
of the sharing of common properties. 
Sometimes this Means that similarity 
between two events is described in 
terms of their distance apart on the 
gees pes Again, it may refer 
to the number o continua OF prop: 
erties) that the ey, for prap 


events have in 
common (6). Obviously some com- 
bination of 


n of the number of shared 
properties and the amount of each 
such property involved in the com- 
plexes would provide a more sensitive 
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index of similarity than either alone. 
However, we shall consider at this 
time the simple conception of sımı- 
larity defined in terms of distance 
between points on a continuum, for 
the more complex structure raises 
other problems, such as the organiza- 
tion of the separate properties within 
the complex, which must be given 
more detailed elaboration elsewhere. 

To say that a common property 1s 
shared by two events means that the 
events lie on the same continuum. 
And in the words of Poincaré“ . . a 
system of elements will form a con- 
tinuum if we can pass from any one of 
them to any other by a series of con- 
secutive elements such that each is 
indistinguishable from the preceding” 
(30, p. 52). In such a series some 
Position A is indistinguishable from B 
and B from C, although A is not equal 
to C. 

It is assumed that distances are 
short within a continuum, so that the 
probability of activation at any point 
in the continuum by activity at other 
points tends to be high. However, as 
the continuum comes to include more 
distinguishable Positions, and hence 
to imply a greater number of events 
between the activation of one end and 
the transmission of the activation to 
the other end, the distance will be 
increased. 
Now it should be recognized that 
the Property represented by a con- 
tinuum may be anything at all which 
the individual has abstracted from 
his experiences and structured in this 
fashion. This means that it may be 
blueness or brightness or other prop- 
erties assigned to physical objects, 

ut it may also be any of the qualities 
that we attribute to human beings 


stich as friendliness, conservatism, 


honesty, coldness, of aggressiveness: 


any one of which may be regarded as ê 


simple property varying in amount 


| 
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and made up of a series of ordered 
positions without gaps between them.® 


Opposites 


At some stage a continuum may be- 
come so highly differentiated that the 
distance between the end points is 
greatly increased and subcategories 
develop within the more inclusive con- 
tinuum. When this has occurred, 
moving from one end of the continuum 
to the other involves passing through 
a zero point or boundary which sep- 
arates the emerging subcategories. 
Thus warm and cold within the 
dimension of temperature, or clean 
and dirty within the continuum of 
cleanliness, exemplify emerging sub- 
categories which are in contact at a 
zero point. Such subcategories are 
usually called opposites. 

The emergence of the opposites 
relation from a similarity continuum 
1s implied in studies which have re- 
vealed that this relation appears 
relatively late in the individual’s life 
history, as compared to similarity (39, 
p. 344). It is also of interest in this 
Connection that various writers, in- 
cluding Freud (9), have noted that the 
same word may serve to represent a 
Property and its opposite in ancient 
languages, such as Egyptian or Latin, 
while modern languages tend to use 
different words for these relationships. 
Latin provides examples like the 
word altus, meaning both high and 
deep, or sacer, to be translated as holy 
Or accursed. Primitive speech is sa! 
to refer rarely to opposites, though 
relations of similarity are frequent 
(38), which again suggests the deriva- 
tive nature of opposites relations. 


s that 


ê It is irrelevant far present purpose 
in fact 


the stry e mrp 
© structure of personality traits 1s } 
More complex and that simple continua are 
ame _ People may actually have such over- 
è plified beliefs about personality structure 
Ven though the psychologist regards the 


elief as inaccurate. 
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But it must be recognized that op- 
posites structures may also develop 
when a common property is abstracted 
from two initially unrelated continua 
or categories which have become as- 
sociated to form subcategories within 
the abstracted dimension. The affec- 
tive continuum, made up of positive 
and negative affect, may possibly 
emerge in this way, although the facts 
are not clear. Neurologically, posi- 
tive and negative affect appear to 
depend on separate centers (24), and 
this suggests, though it does not prove, 
that the relation between them has 
been constructed by learning rather 
than initially given. On the other 
hand, early emotional reactions have 
been said to be undifferentiated ex- 
citement with the pleasant (approach) 
and unpleasant (avoidance) patterns 
emerging at a later date (4, p. 87). 


Complements 


Whatever the origin of the pattern 
of opposition, it may be regarded as 
intermediate in its characteristics be- 
tween the similarity structure, within 
which distances are shortest, and 
those structures called ‘‘complement- 
ary,” where distances are greater. 
The latter are composed of parts 
which are not necessarily related by a 
continuous series but which are never- 
theless separated by some finite 
distance. | Moreover, together the 
parts constitute a whole or they 
produce some common consequence. 
This is to say that complements are 
associated, even though not similar in 
the sense of lying on the same con- 
tinuum and sharing a common prop- 
erty: Thus a figure and the back- 
ground without which there is no 


figure, the concepts of the role of 
lowers, of men and 


r and foll 
leader ane be thought of as palts of 


women, may be l ; € 
events whose underlying relationship 


ig usually of this kind, 
s 
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Antecedents and Consequents 


When one event is responded to as 
preceding another, the psychological 
distance between them is greater than 
that between complements. The 
magnitude of this distance is a func- 
tion of the length of the temporal 
interval between the occurrence of the 
events in past experience, as well as of 
the frequency and consistency of this 
experienced relation. Moreover, the 
distance from antecedent to conse- 
quent will typically be shorter than 
that from consequent to antecedent. 
This assumption is based on the fact 
of the relatively greater strength of 
forward than of backward associations 
(13, p. 622). 
There are many situations that pre- 
sumably involve structures of this 
kind. For example, a man and the 
acts which he performs will sometimes 
take on the antecedent-consequent 
relation. Heider (10) has presented 
interesting speculations about this 
matter, and has suggested that al- 
though these events may be a se- 
quence, the man and his acts tend to 
be treated as a whole: the acts are 
characteristics of the man rather than 
separate events. We would suggest 
that it is necessary in any particular 
case to determine the distance between 
the two events, the Probability that 
the activation of the concept man will 
elicit the concept of the act in ques- 
tion, as well as the probability that 
the activated notion of his act will 
elicit the concept of the man. As the 
distance becomes greater and the 
relation more asymmetrical, it will 
take on the properties of the ante- 
cedent-consequent relation, It is evi- 
dent that precise boundaries between 
the complementary relation and the 
antecedent-consequent relation can- 
not be established. In this, as in all 
distinctions between the structures 
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described, we are dealing with relative 
distances, so that it is simply a matter 
of convenience that leads us to apply 
different terms to relations which 
involve different relative distances. 


Abstracted Relations 


The discussion has been focused so 
far on the nature of certain concrete 
relations between events, but it is 
important to note that such relations 
may be conceptualized. Boring is 
making this distinction between a 
relation and the concept of it when he 
remarks, “A succession of ideas is not 
equivalent to an idea of succession”’ 
(3, p. 575). The idea of succession is 
an abstraction of one aspect of an 
activation pattern in which some 
event follows another. In the same 
way, similarity between two events is 
not the same as the idea of similarity. 
A simple similarity structure means 
that two points share a common prop- 
erty, in the sense of lying on the same 
continuum. But the pattern result- 
ing from activation of two such points 
on the same continuum may have 
become related to another structure 
which, when activated, is the con- 
ceptualization of similarity. An ab- 
straction is, therefore, just another 
event, resulting from the activation 
of a structure, but it is important to 
remember that distances within a 
concept of similarity-between-A-and-B 
will not necessarily be the same as the 
distance between 4 and B on the 
dimension from which this abstraction 
was derived. This does not mean 
that principles governing a certain 
relation and the concept of that rela- 
tion are different, but rather that the 
Precise structure of a concept of 2 
relation must be determined, and not 
assumed to be the same as that of the 
referent structure. 
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Complex Structures 


In order to make as clear as possible 
the essential features of the concept 
of structure as here presented, simi- 
lars, opposites, complements, ante- 
cedents, and consequents have been 
treated separately as though two 
given events were related in only one 
of these patterns at a time. The fact 
is that the different aspects of complex 
structures may have different and 
multiple relations. Thus, the con- 
cepts of leaders and followers, though 
complementary, may also share com- 
mon properties, such as having similar 
attitudes toward certain policies. 
Moreover, two complexes may share 
more than one dimension or they may 
have more than one complementary 
property. This means that the com- 
Ponent aspects of two complex events 
may be different distances apart, so 
that the distance between the com- 
plexes themselves must be a function 
of the combination of these multiple 
relationships. It is suggested that 
the commensurability of the distances 
on different dimensions depends on 
the combination of the probabilities 
of activation implied by each relation- 
ship. In other words, if complex 
structures, A and B, have positions 
Xa and X, on continuum X and posi- 
tions Y, and Y, on continuum Y, then 
the distance from A to B is a combina- 
tion of the probabilities that activa- 
tion of X, will result in activation of 
Xy and that Ya will produce activation 


of Ys, 


ACTIVATION WITHIN SPECIFIC 
STRUCTURES 


Assimilation and Contrast 

The activation principles which 
have been stated must now be applied 
in order to illustrate the kind of 
resultants that may be produced 
Within specific structures when acti- 


vated at certain points. The first 
notion to be discussed is the familiar 
one® that at short distances between 
the points of input the result will be 
assimilation, and at somewhat greater 
distances, contrast.? 

Despite the different shades of 
meaning that have been given the 
term assimilation, all uses emphasize 
similarity between processes in one 
way or another. Sometimes assimi- 
lation is known by the fact that the 
classification of events is the same; 
again, it is regarded as the incorpora- 
tion of one event by another (29, p. 6); 
or it refers to the fact that two stimuli 
produce the same response (31, p. 8); 
or events are perceived as the same 
(20, p. 95). We shall apply the term 
to that resultant of two-point activa- 
tion which is intermediate in some 
respect between the results that 
appear when either one of the points 
is activated without the other. In 
other words, assimilation has occurred 
when distance between the points is 
reduced by their simultaneous activa- 
tion, so that a tall man looks smaller 
and a small man taller if they are to- 
gether than when alone. On the 
other hand, contrast implies that the 
distance is exaggerated when two 
points are activated at the same time, 
with the consequence that the tall 
man looks taller and the small man 
smaller when they are together. 


The recent studies of Sherif, Taub, 


and Hovland provide one illustrative 


test of the hypothesis under condi- 
tions which systematically vary the 
distance between structures activated, 
in order to determine whether or not 


6 See, for example, Woodworth (39, p- 507; 
569), Koffka (17, Pp. 502), Krech and Crutch- 

20, p. 95). 
waits conditions that cannot be elab- 
orated, summation may possibly occur at the 
shortest distances. In this case either 
judged greater in the 


stimulus would be 
presence of the other than when alone. 
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is a shift from assimilation to 
p A as the distance increases (34). 
A series of weights was judged, first 
without an anchor and later following 
an anchor weight. In one condition, 
the anchor was equal to the heaviest 
of the original weights, and in another 
it was equal to the lightest. The 
judged heaviness of the weights in- 
creased when the heavier anchor 
preceded the weight being judged and 
decreased when the lighter preceded. 
This is the result that would be ex- 
pected if the perception of either of 
two events, with a small distance be- 
tween them ies between the position 
of the two judged separately. In 
another portion of this same study the 
authors showed that as the difference 
between the anchor and the weights 
of the original series was made still 
greater by an increase in the weight 
of the anchor, the judged weight of the 
original stimuli decreased. When the 
anchor was farther away from the 
weights in the ‘original series and 
lighter, the original weights were 
judged heavier. This is the common 
observation of contrast made in the 
context of adaptation level theory. 
Further evidence has been pre- 
sented by these authors suggesting 
that reactions to communications 
about social issues follow these same 
principles, for statem 
hibition which fell with; 
tances of the individ 


position “than was actually the case” 
(14). In other words, there was a 
disposition to exaggerate the distance 
between own view and that of others 
in the latter case (contrast) and to 
underestimate it in the former (as- 
similation). 

Although this kind of evi 


dence sup- 
ports the notion that distan 


ce between 


activated points is a factor in deter- 
mining whether assimilation or con- 
trast occurs, it remains to show why 
this should be so in terms of the 
assumptions made in this paper about 
distance and activation. We must 
ask what differences in the activation 
pattern would make us predict as- 
similation at short distances and 
contrast when the distances are longer. 

The theory. The essential feature 
of the theory being proposed to ac- 
count for this relationship is first of 
all the fact that at short distances 
simultaneous activation of two points 
A’ and A” by stimuli S’ and S” will 
result in multiple input to, and there- 
fore maximum Probability of arousal, 
of points lying between those posi- 
tions that would be activated by the 
or S” alone. 
nput at short 


Principle III. 
uch a pattern of activation should 


result in a judgment of 4’ or of A” 
Positions that 


or pattern of 
of activation becomes 
two points of maximum 


activation separated by 


bimodal, with 
probability of 
points with 1 
this case, the 


5 From this Point on, the phrase activation 
surface will be used to refer to a pattern of 
Probabilities of activation resulting from 
; ructure at certain points. 
The line labelled combined probabilities HR 
Fig. 2 and 3 represents the surface resulting 
from activation of the structure at points 
~ Similarly, the curves of Fig. 4 
and 5 represent activation surfaces. 


es Pei. 
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the two maxima. According to Prin- 
ciple IV, the number of events be- 
tween any two points is an inverse 
function of the probability that ad- 
jacent points will be activated simul- 
taneously and a direct function of the 
distance between the points, when 
that distance is inferred from the 
position of the two stimuli presented 
at different times, so that there is no 
multiple input. It will be shown 
that at longer distances where the 
resulting surface is bimodal but still 
determined in part by the multiple 
input to intermediate points, the 
number of events, and hence the 
distance, will be increased dispro- 
Portionately. This exaggeration of 
the number of évents intervening be- 
tween points of maximum activation 
results in contrast because the posi- 
tions of the two maximum points will 
be seen as farther apart or separated 
by a greater number of events. 
Development of activation surfaces as 
@ function of distance between points 
of input. In order that this theory 
may be made clear it is necessary to 
show how different activation sur- 
faces arise as a function of distances 
between the relative positions of input 
on some dimension. In Fig. 2, 3, 4, 
and 5, S’ and S” are stimuli which 
activate the continuum, A, at points 
A’ and A” respectively. Xs are other 
Structures related to A at various 
distances. Now Xs, like As, repre- 
sent any psychological process what- 
Soever—concepts, names, affective 
responses—that are activated with 
some probability by the arousal of A; 
they may or may not constitute a 
continuum; and they will be related 
to each other in varying degrees, 
depending on their developmental and 
activation history. However, if A is 
a continuum with the result that 
distances within A are short, and if 
distances between As and Xs are 


short, it should follow that Xs will 
have tended to become an ordered 
series connected by relatively short 
distances. This must be the case 
because Xs will have frequently been 
aroused at the same time as a result 
of activity in A. As and Xs in the 
accompanying figures are, then, sim- 
ply different parts of the structure 
which are represented as having 
asymmetrical relations (connections 
in the direction from A to X). This 
means that activation of A will be 
followed by activation of Xs. 

Figure 2 presents a schematic 
picture of relations existing between 
S' and A’, between S” and A”, and 
between points on A and certain Ss, 
It will be noticed that a given A point 
is represented as being related to 
several Xs. X’, and X”. represent 
direct, maximum probability responses 
to the activation of A’ and A”, 
respectively. The connections of A’ 
to Xi, Xo, X3, Xs, Xe, and X7 and of 
A” to Xs, Xa Xs, X7, Xs, and Xs 
are structural relations whose activa- 
tion produces generalized reactions 
that have been developed in the 
following manner. Since distances 
within a continuum are assumed to be 
short, it must follow that when any 
point A’ has been activated in the 
past and followed by activation of 
some structure X’, other points (A,s) 
on A will also have been aroused at 
at the same time, the probability of 
this simultaneous arousal decreasing 
with increase in the distance of the 
given A point from A’. As a con- 
sequence, structural relations be- 
tween these A points and various Xs 
will have developed,’ their distance 
being an inverse function of the dis- 
tance A’-A,. Though separate con- 
nections from A to X are drawn for 


® See page 328 on restructuring and learning 
for the discussion of conditions of building a 


relation such as this. 
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convenience, the more accurate pic- 
ture would represent a sheaf or cone 
of relations without gaps in points on 
A that are connected with Xs5,10 

The nature of the activation surface 
at X will depend on the input to each 
X from all A sources, combined ac- 
cording to the assumptions of Prin- 
ciples II and III, The activation 
surface of Fig. 2, labelled combined 
probabilities, is obtained when the 
distances between the various points 
are assumed to be those indicated in 
the figure caption, when input to the 
two points on A are close enough 
together to provide multiple input to 
all Xs except those on the end of the 
continuum, and when the stated 
combining principles are applied to 
obtain the probabilities,u For ex- 


1” We have not taken account of possible 
input to Xs from points on A other than 4’ 
and A”. If this were done, the Probabilities 
of the intervening points would be even 
higher. 

" Obviously such a surface will change not 
only with change in distance A’ — A”, but 
also with different values of each distance in 


g probabilities of activation to each point 
are stimuli activating points A’ and A” on 
er points with the probabilities indicated 


ample, in Fig. 2 in the row marked 
PS -- > AN MG SX) the 
probability below X, (0.09) is the 
product of the probability that S’ will 
activate}A’ (0.9) and that A’ will 
activate X, (0.1). In the same row, 
the Probability under Xa (0.45) is 


the product of P(S'---> A’) and 
p(A' -> Xo) 


and the next one w 


multiple input at each X point. For 
example, X; has a Probability of 
arousal by S’ equal to 0.63 and by S”, 
0.09. The combination [1 — (0.37 
X 0.91)] is 0.66 (Principle III). 

nder these Conditions, it is clear 
that high Probability activation oc- 
curs at points between X’ and X”, 
and that the Probabilities fall off 
rapidly on either side of these maxima. 
gS 


the structure. Also it should be noted that, 
as the probabili 


yt ty values approach unity, less 
change will be produced by the existence of 
multiple input to any point. 


whys 


a 


PF 
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b(A’: Xn) 


09 .45 .63 .72 .63 .45 .09— x 


Xa Xs Xe Xr Xe Xo 


72 .63 45 .09 


—.08 40 .56 .64 .56 40 .08 


DS-A") p 
So 


pA" Xn) 


16 .67 .85 .94 .94 .91 .69 45 .09 Combined $ 


Fic. 3. Activation surface derived from combining probabilities of activation to each point 
on X when there is additional input to X from instructions (Sy) to judge S’. 


Furthermore, the points between X’ 
and X” are activated with higher 
relative frequency than any others on 
the surface. This assimilation sur- 
face involves, then, maximum prob- 
ability of activation at points inter- 
mediate between those positions that 
would be maximal if S’ or S” were 
presented and judged alone. 

But a symmetrical surface of this 
kind results only if there are no sets, 
instructions, contexts, or other factors 
that provide differential input at A’ 
and A”. Figure 3 suggests the effects 
of such differential input on the 
surface. Suppose the subject is told 
to judge the intensity of S’. It is 
suggested that such an instruction 
produces additional input to 4’, 
deriving from a motivation system 
set up by the instructions. This 
added input, identified as Sr- > A’ 
in Fig. 3, is assumed to be equal to 0.8 
and will have the effect of skewing the 
activation surface and of producing 
maximum probability of arousal at a 
point which is closer to X’ than in the 
case of Fig. 2. Instructions to judge 
S” would skew the surface in the 


opposite direction. 


If the same principles of combining 
input at various points, the same 
distances within the A dimension and 
between A points and X points, and 
the same instructions to judge S’ are 
assumed, we obtain the series of 
surfaces of Fig. 4 and 5 as the distance 


between the two points of input to A 
is increased. It will be seen that in 


+25) 


PROBABILITY OF ACTIVATION OF X 


o'tr23456786890N 2B 
ORDERED POSITIONS 
ONA 


Fic. 4. Graphic representation of activa- 
tion surfaces resulting when distances between 
points of activation are short. Surfaces A,B, 
C,and D. S’ and S” represent input points 
for the different surfaces. 
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PROBABILITY OF ACTIVATION OF X 


H 
012345678901 21314 
ORDERED POSITIONS 


ONA 


Fic. 5. Graphic representation of activa- 
tion surface resulting when distances between 


points of input are greater. Surfaces E, F, 
G, and H. 


passing from surface A to H, the 
breadth of the surface increases, bi- 
modality increases, and the values of 
the probabilities of activation at inter- 
vening points decreases. The relative 
number of events occurring between 
points of activation must therefore 
increase because of increased distance 
and because adjacent points will not 
so often be activated simultaneously 
(Principle IV), 
Change in the r 
distinguished events 
distance between two 
tion. Figure 6 repre: 
of the change in the relative number 
of events occurring between two input 
points with increase in the distance 
between the points, this dist 
inferred from the difference in the 
judged positions of the points when 
each stimulus is presented and judged 
alone. In Fig. 6 the average prob- 
ability of simultaneous activation of 
those adjacent pairs of points lying 
between input points is used as an 
index of the relative number of events 
(Principle IV). Operationally, the 
average probability that adjacent Xs 
will be activated is assumed to be 


elative number of 
as a function of 
points of activa- 
Sents an estimate 


ance being 
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99 


ler a5 6.7 © 
DISTANCE BETWEEN A' AND A’ 
Fic. 6. Relation between difference in 


judges position of S’ and S” when presented 
separately (distance 4’ — 4A” inferred) and 


equal to 
2 (papei) 


F Pest-borgs) (parap) 
N 


where p,, = Probability of activation 
of 2") Pay. = Probability that X 
adjacent to x7 is activated ; Pari 
= probability that the X adjacent to 
X” and lying between X’ and X” will 
be. activated; and N = number of 
adjacent Pairs between X’ and X”. 
his average Probability of simul- 
taneous activatio 
each surface 


p> 


' 
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distance between input points) and 
the point derived from surface H (no 
overlapping input to intermediate X 
points). When the solid line is above 
the dotted one, this means there is a 
smaller average probability of simul- 
taneous activation of adjacent points, 
and hence fewer intervening events 
for that surface than would be ex- 
pected if the number of events were a 
linear function of input distance. If 
the solid line falls below the dotted 
one, there are relatively more inter- 
vening events for the surface. 

It will be seen that the curve drops 
at an accelerated rate for surfaces 
B, C, and D, and that it falls at a 
decelerated rate for D, E, F, and G. 
This implies fewer events per A'A" 
unit of distance for the surfaces 
deriving from short distances between 
input points, and relatively more 
events per unit for the longer distance 
surfaces. Within the first group as- 
similation occurs; within the second, 
contrast, for as a result of this dis- 


12 Unfortunately, there is not space to 
discuss the relation of adaptation level theory 
to this formulation. In brief, adaptation 
level is conceived as that X point (Xar) 
which has the largest number of high prob- 
ability connections from points on continuum 
A. Xar will be the point or points directly 
activated by the As most frequently aroused 
in the past, because as a result of this fre- 
quency Xaz will have had the best chance of 
becoming connected with all A points. If A 
has been activated at a number of points, 
those X positions near the center of the series 
will have the highest probabilities of arousal 
from any point on A, because such central 
points are closer on the average to other 
Points on the continuum than are points at 
the end of the continuum. It is for this 
reason that the AL tends to lie at an average 
value of the experienced intensities of a 
stimulus. Given such a structure, the 
activation of any point A, will elicit not only 
its own X, with high probability but also Xaz 
through the generalized connections. In this 
Way an activation surface is set up, the 
Properties of which determine whether Xn and 

‘az are assimilated or contrasted in exactly 


proportionate increase in the number 
of events at the longer distances, the 
judged position of the two stimuli will 
be shifted toward the ends of the 
continuum to produce the contrast 
effect. 

The question inevitably arises how 
these activation surfaces come to 
produce a single response, such as the 
judgment of the position of S’, which 
in the contrast condition will be dis- 
placed toward the end of the con- 
tinuum. We must assume, without 
presenting the justification for the 
assumption, that this pattern of 
events (the activation surface) has 
become related to a structure repre- 
senting some such response as S” has 
low intensity of value E. Another 
activation pattern consisting of two 
maxima and fewer intervening events 
would be more closely related to a 
point of the structure whose activa- 
tion produces another judgment. But 
such responses will be complexly 
determined by input from points of 
maximum probability of activation 
on X combined with the process 
representing the number of interven- 
ing events. This suggests that con- 
trast-determined responses should 
have longer latency than those result- 
ing from assimilation. 


Algebraic Summation and Cancellation 
Effects ` 


It will be recalled that opposites 
structures involve continua which 
have developed subcategories as the 
distances along the continuum are in- 
creased. In considering the influence 
of such arrangements on the course of 
activation, we must ask first whether 


aS 
the same fashion that a surface produces 
assimilation or contrast when there are two 
external sources of input to A. The effect 
of this on the judgment of a single stimulus 
has been neglected for the sake of simplicity. 
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assimilation and contrast occur within 
ructures. . 
rie for example, the opposites 
structure underlying positive and 
negative evaluation or affect. Con- 
tinuum A is activated by input at two 
points. Suppose that. the Xs most 
readily elicited by activation on one 
side of the zero point of this con- 
tinuum represent unpleasant affect, 
and on the other side, pleasant affect. 
The mid-category of this series will 
refer to zero affect, which is a definite 
position or structure whose activation 
produces this response. Insofar as 
there is generalized input from points 
along the positive-negative continuum 
to intermediate X structures including 
the zero category, assimilation or con- 
trast will result for the same reasons 
and under the same conditions as they 
appeared within the simple similarity 
dimension. 

But the result will appear as 
algebraic summation of the two 
activated points. Thus, if one activa- 
tion point lies near the extreme nega- 
tive end of the continuum and the 
other point activated at the same time 
is near zero but on the Positive side, 
the intermediate point of maximum 
activation should be negative. Con- 
versely, with high positive and mod- 
erately negative points activated, the 
result will be slightly positive, But if 
the activation points are equidistant 
from zero, then the zero category will 
be the one most probably activated, 
In other words, the Phenomenon of 
algebraic summation of Positive by 
negative affect is a special case of the 
more general theory of assimilation, 

and depends on (a) the existence of 
ordered arrangements of positions ona 
continuum, (6) input from the two 
points of activation to intermediate 
points, and (c) the existence of sub- 
categories within the dimension such 


that different responses are produced 
if the point of maximum activation is 
on one or the other side of zero. 

There is some evidence that such 
summation of positive and negative 
affect actually occurs. For example, 
Rosenberg’s (32) successful prediction 
of attitudes toward segregation from 
the combination of positive and nega- 
tive consequences related to segrega- 
tion implies such summation. And 
Osgood and Tannenbaum’s evidence 
for their congruity theory of attitude 
change also implies this same pooling 
of affect organized around a concept. 
Although they emphasize the assimi- 
lative aspect of the process, they do 
recognize that assimilation may not 
take place, when they say: 


The amount of incredulity produced when 
one object of judgment is associated with 
another by an assertion is a Positively accel- 
erated function of the amount of incongruity 


which exists and operates to decrease attitude 
change, complete! 


y eliminating change when 
maximal (25, p. 47) 


In other words, “incredulity” tends to 
appear when the affect attached to a 
statement attributed to some source 
is distant from the level of affect (or 
evaluation) attached to the object 
about which the statement is alleged 


to have been made, so there is no 
assimilation, 


Presumabl 
within an o 
the distance 


y contrast may also occur 
PPosites structure when 
e is greater, provided the 
points activated are not at the ends 
of the continuum, Contrast is obvi- 
ously impossible in this latter case, 
ecause activation cannot be shifted 
° more extreme points on the con- 
tinuum even though the activation is 
of the sort that usually leads to con- 
trast. The extreme response is given 
when the stimuli are presented alone; 


it cannot be any More extreme when 
the stimuli occur together. 


| 
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Balanced and Unbalanced Structures 


Heider (11), Newcomb (23) and 
others have used the language of 
Lewinian field theory in discussing 
“strains toward balance and sym- 
metry.” More recently, Cartwright 
and Harary (7) have described the 
same situations in the more general 
terms of graph theory. It is our hope 
that the same material may be 
handled ultimately by the concepts 
formulated in this paper. 

Let us describe a structure which 
appears to have the properties of 
balance. A friend (F) has in the past 
often elicited in an experiencing in- 
dividual strong positive feelings and 
on rare occasions negative feelings. 
Assume that these experiences have 
resulted in building the structural 
arrangements of Fig. 7. F is close to 
P (strong positive affect, p = 0.9); 
F is far from N (negative affect, 
p = 0.1). Positive and negative af- 
fect lie on an opposites dimension, the 
ends of which are represented as being 
far apart (p = 0.1). The relations 
are symmetrical. 

On some occasion the friend is 
present as a stimulus which is assumed 
to provide cues for the direct activa- 
tion of structures F, N, and P, the 
probability that S will activate each 
of these structures being equal to 0.9. 
What will the outcome be? 

On the cycle following input to these 
points, F and P will each be activated 
with a probability of 0.83, while the 
probability of N will be 0.17. These 
values are derived by applying Prin- 
ciples II and III to the situation. 
For example, the probability of the 
activation of P from the direction of 
F is the product of the probability 
that F will be activated by Sf (0.9) 
and that P will be activated when F 
is active (0.9). This is equal to 0.81. 
The probability that P will be acti- 
vated from the direction of N is the 


product of 0.1 and 0.9, and is equal to 
0.09. Combining this multiple input 
(0.81 and 0.09) yields a probability 
of 0.83. In similar fashion, the 
probability of activation of N and F 
may be calculated for this cycle. 

The obvious fact here is that F and 
P are activated with higher prob- 
abilities than M, and therefore will 
more often be aroused at the same 
time (p of simultaneous arousal 
= pr X pr). As a consequence, F 
and P (the friend and positive affect) 
tend to be part of the same whole or 
the same unit event from which NV is 
excluded, because of its distance from 
the other two structures and the lower 
probability of its being activated at 
the same time as they (Principle IV). 

This structure is balanced, then, in 
the sense that (a) its activation will 
tend to result in two events or wholes 
with relatively clear-cut boundaries, 
F-P on the one hand and N on the 
other; and (b) the activation. should 
be of short duration because the isola- 
tion of the F-P segment from N 
makes this essentially a simple two- 
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Fic. 7. A “balanced” structure. The ps 
are probabilities of activation of one point as 
a result of activation of the point connected 
with it by a line. Arrows indicate direction 
of the relation which is in each case sym- 
metrical. Ss are points of stimulus input. 
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Fic. 8. An unbalanced structure. 


part structure consisting of the F-P 
Segment and the N segment. The 
mutual activation of F by P and P by 
F would be expected to produce rapid 
activity decrement because of the sim- 
plicity of the pattern and the short 
distances within this F-P segment. 
Activity within N should be of even 
shorter duration (Principle V). 

Figure 8 represents certain aspects 
of a structure assumed to underlie 


0.83. 
Given these values, the Probabilities 


are high that these three points will be 
activated simultaneously and that 
they will be part of the same whole: 
b(Fand P) = 0.8; b(N and F) = 0.8, 
and p(P and N) = 0.69, However, 
despite the fact that the Probability 
of the simultaneous occurrence of P 
and WN is relatively high, they are 
distant from each other and so will 
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not so readily become a unit event 
(Principle IV). The surfaces result- 
ing from the activation of either F and 
N or F and P would be assimilative in 
pattern. However, any pattern 
which includes F, N, and P will in- 
evitably lead to diffuse activation, 
since negative (N) and positive (P) 
affect will typically be associated with 
different structures, Moreover, since 
in this case, unlike the balanced 


As a result, it 
that activity 
within this structure would tend to be 


distances are 


terms were people, related by liking, 
disliking, similarity, Causality, and so 


he would regard 
as balanced a structure in which A 


while B likes C. 


Here, as 


BI tis interesting to note in this connection 
the common root in the phrases to like, to be 
alike, and to be likely. This suggests a 
common Meaning for liking (positive atti- 
tude), being alike (similar or short distance 
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before, the structure is balanced in the 
sense that B and C act as a unit with 
consequent assimilation, while A is 
isolated. In Heider’s unbalanced ar- 
rangement, A likes B and C but C 
and B dislike each other. Again, 
one event, A, is close to two others, 
B and C, which are far apart, and this 
pattern will have the properties of the 
schema of ambivalence in Fig. 8. 

These examples suggest that the 
basic characteristic of the arrange- 
ments which have been called bal- 
anced is the fact that they provide the 
conditions for the development of 
focused and circumscribed activation 
surfaces which result in high prob- 
ability activation of definite reactions 
and prompt completion of activity 
within the particular structure. On 
the other hand, the so-called un- 
balanced structure leads to a more 
diffuse activation surface and longer 
Continued activation within the 
Structure.“ 

It is of interest that Heider’s un- 
balanced structure has been shown by 
Kogan and Tagiuri to be relatively 
Tare (18). Thus configurations in 
which a person we like is conceived to 
Choose one we dislike, or in which a 
Person we dislike chooses one we like, 
are both less frequent than would be 
expected by chance. This should 
follow from the fact that in the un- 
balanced structure events such as 
these all tend to be activated simul- 
taneously, frequently, and repeatedly 
with the consequence that the dis- 
ss 


part), and being likely (high probability of 
activation of one by the other). 
“There are other interesting properties of 
these structures that cannot be discussed now. 
Or example, a greater average difference in 
th € distances between the same two points via 
ifferent paths may be an important char- 
acteristic of the unbalanced structure; that is, 
€ distance from A to B may be much greater 
pr less than distance A to C to B in un- 
alanced structures, but more nearly equal in 
alanced structures. 


tances between parts will tend to 
become shorter, other things being 
equal. In other words, this un- 
balanced system of relations provides 
certain important conditions for re- 
structuring and learning which leads 
to a new structure in which the parts 
are more closely related and hence 
tend to form a whole. 


Motivation 


It should be noted that those prop- 
erties which lead to imbalance as de- 
fined above are the essential prop- 
erties of a motive structure. The 
author has stated in an earlier paper 
(27) that when there is a certain 
degree of disparity between two events 
which are related or similar in some 
way, activation tends to be main- 
tained in the system and thus to 
provide the continuing activity which 
is one characteristic of the motivated 
state. This activity continues until 
some change in the situation reduces 
the disparity or makes it so great that 
activity ceases. Thus disparity be- 
tween the perception of one’s status 
and that of his neighbor may con- 
stitute such a system, provided (a) 
that the two events, own status and 
that of the neighbor, are close enough 
together to produce a relatively high 
probability of activation of one term 
of the structure by activity in the 
other; and (b) that the disparity (or 
distance) and/or the number of parts 
are great enough to prevent activity 
decrement. Stated in terms of the 
present discussion, this amounts to 
saying that a motivated state exists 
when there is distance between events 
(disparity) great enough to prevent 
their being reacted to as a single 
whole with consequent activity decre- 
ment, and when at the same time the 
events are close enough together or 
related in such a way that disparate 
terms of the structure are mutually 
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ivati on 
activating, perhaps by a comm 
associate such as F in the unbalanced 
structure of Fig. 8. 


SUMMARY 


This paper has proposed that the 
course of events within psychological 
space is determined on any occasion 
by the nature of the structure, i.e., 
by the position of parts and the 
distance between them; and by the 
points in the structure which receive 
activation from some source at the 
moment in question. 

Structures involving relations of 
similarity, Opposition, complements, 
antecedents, and consequents are 
described in terms of the position of 
parts and the distance between them. 
The course of activation within cer- 
tain of these structures is Predicted 
on the basis of assumptions about the 
following: (a) the relation of distance 
to the Probability of activation; 

(b) the combination of Probabilities 
when there is multiple input to the 


about the 


Principles may 
account for phenomena such as con- 


text effects, assimilation and contrast, 
algebraic summation of Opposites, 
“balanced” and “unbalanced” func- 


tions, and the Persistence of activity 
in the motivated state. 
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BASIC ISSUES IN PERCEPTUAL THEORY 


W. M. O’NEIL 


University o, 


An attempt to classify modern the- 
ories of perception will be used to bring 
out some major points at issue between 
them. Some of these issues are epistemo- 
logical but, as the Psychologists’ at- 
tempts to shelve such questions as non- 
Psychological have proved unavailing, 
it is better to face them without too 
much apology. 

Five bases of classification are pro- 
posed. yielding 
cells, as it were, in a five-dimensional 
solid. Though the five bases are logi- 
an historic 
tendency for the Position taken on one 
aken on an- 
cells are re- 
This fact en- 


ether perceiving 
r something done 


course of the or- 
ganism’s coping with its surroundings, 
take an activist position, Whereas those 
who think of it in terms of the reception 
of stimulation, or as the input of in- 
formation, are passivist. It is improba- 
ble that any contemporary theorist is as 
thoroughgoing a passivist as was, say, 
James Mill (35). With the common 
agreement that perception is selective, 
there is an admission that the organism 
is at work on the available stimuli or 
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information. Nevertheless, there is a 
strong passivist tendency amongst the 
more behavioristically minded of the 
contemporary theorists. After Titchener 
there has not been any strong tendency 
of this sort amongst the phenomenalists. 
Within the activist theories there are 
some important differences. Primary 
among them is the treatment of percep- 
tion as reactive or as proactive, to use 


Murray’s terms (37). Those who re- 
gard perceptio 


real or as a phe- 
The issue, an essen- 
cal one, is whether we 


nomenal object, 
tially epistemologi 
perceive a real or 
ject or a represe 


: “Sensation 
the sensorium receiving 
through the medium of the nerves, and 


as the result of the action of an external 
cause, a knowledge of certain qualities 


‘ 


wadi a. iii 
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or conditions, not of external bodies, but 
of the nerves of sense themselves” (36, 
p. 1065) and “The immediate objects of 
the perception of our senses are merely 
particular states induced in the nerves, 
and felt as sensations” (36, p. 1073). 

Early experimental psychology in- 
herited a clear phenomenalist tradition 
from both sides of its family tree. In- 
deed, the analytic phenomenalism of 
British associationism blended quite 
readily with the analytic phenomenalism 
of the sense-physiologists such as Bell, 
Miiller, and Helmholtz. The rise of 
realism in British philosophy in the 
early years of this century gave some 
special twists to the tradition, but it was 
American realism, especially working 
through behaviorism, which rejected it. 
However, the behaviorists largely va- 
cated the field of perception. Their 
prime interest was in effector-based re- 
sponses, and any interest they had in 
sensory processes was limited in the 
main to questions of sensitivity. There 
was an early promise, made first by the 
Philosopher Perry (39), to give a full 
treatment of perception in terms of dis- 
criminative response, but only recently 
have serious efforts at fulfilment been 
evident. Thus it has been left largely 
to the phenomenalists to cultivate the 
greater part of this field. 

The problem of error in perception or, 
put the other way, of veridical percep- 
tion is a genuine one for psychology as 
well as for epistemology. All too often 
the psychologist with phenomenalist 
Convictions regards it as not his concern 
to make out how the organism attains 
knowledge of the external world; this 
includes, of course, the scientist attain- 
ing knowledge about his subject matter. 
As a psychologist he is ready enough to 
confine the organism to an acquaintance 
With phenomena, yet as an organism 
theorizing about perception he presumes 
an acquaintance with both appearance 


and reality. 
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Two phenomenalists as different as 
Titchener and Koffka used the fact of 
error in perception to distinguish the 
arena of psychology from that of physi- 
cal science. Titchener (45) analyzed 
visual and other illusions in order to il- 
lustrate his distinction between experi- 
ence viewed as independent of and as 
dependent upon an experiencer. Koffka 
(27) made his distinction between the 
geographical and the behavioral environ- 
ment the more convincing by his de- 
scription of the dog bounding into a 
snow-filled crevasse which he took to be 
solid snow-covered earth. In general 
the phenomenalists make out a persua- 
sive case that we perceive not real things 
but appearances, and that we respond 
to things not as they are but as they 
appear to be. These propositions, how- 
ever, make it impossible to deal with or 
to recognize the fact of error which had 
so large a part in leading to them. If it 
were the case that we know only phe- 
nomena, there would be no way for us 
to get to know about error. Only if we 
truly know the real on some occasions 
can we tell that on others we have fallen 
into error. Within the realist position 
it is not difficult to show how the fact 
of error in perception can be recognised, 
though precisely which piece of knowl- 
edge is fallacious may be quite difficult 
to decide. Were realism to make out 
that such a decision was easy, it would 
be embarrassed by the fact that scien- 
tific progress is so difficult. In accord- 
ance with the formula about the whole 
truth and nothing but the truth, we may 
distinguish errors of omission and errors 
of commission. The former is quite 
easy for the realist to handle. For in- 
stance, in terms of perception as a dis- 
criminative response, it may be said that 
the organism has failed to respond to 
the object or that it has failed to re- 
spond differently to different objects. 
The error of commission is illustrated 
by the situation where we seem to have 
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j f such and such properties 
aoe a yet where other, especially 
later, experience convinces us that there 
was no such thing confronting us. The 
realist, content with the discriminative 
response view as it is usually formulated, 
has no answer to the phenomenalist’s 
question: “Where’and what was the ap- 
parent object?” 

The perceived as a term or as a prop- 
osition. The distinction being made is a 
logical and not a verbal one. A term is 
any property, such as what the word 
“red” means, or any set of properties, 
such as what the words “the red piece 
of paper” mean. A proposition is the 
predication of one term of another. The 
words “the red piece of paper is on a 
gray background” express or symbolize 
a proposition. In the behaviorist the- 

ories, as well as in most phenomenalist 

theories of perception, the perceived ob- 
ject is treated as a term. Thus the 
organism discriminates the red paper 
from its gray background or the per- 
ceiver experiences the field as a red 
paper figure on a gray ground. But if 
we consider perception as attaining 

knowledge, at any rate what James (26) 

meant by knowledge about, we need to 

think of what is known as having propo- 
sitional form. Facts are not in the 
ordinary sense things or terms; they are 
not what can be expressed by means 
of nouns, adjectives, verbs, or adverbs 
alone. Facts are what we need sen- 
tences to talk about, and knowledge of 
a fact is judgmental. Those adopting 
the propositional view have been in 
recent times few in number and almost 
always no more than implicit adherents, 
There being so many who have treated 
the perceived object as a term, a much 
greater diversity amongst them can be 
expected. 

Three major conceptions of the per- 
cept can be distinguished amongst the 
phenomenalists. At one end are to be 
placed those whom Boring (7) calls the 
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Leipsigers, and at the other are his 
Berliners. The Grazers, kept company 
by Spearman and a large band of func- 
tionalists, are located somewhere in be- 
tween. The Leipsigers, especially in the 
purified form of Titchener, agree with 
the radical behaviorists in being satisfied 
with simple discrete substantial terms. 
Titchener’s sensations, despite his talk 
of context, are just as simple and dis- 
crete as Watson’s stimuli, despite his 
talk, of sitytior -="Titchener had no 
D. i 3 : BER 
way of including re.ationships within the 
experienced, and a behaviorist who con- 
ceives stimuli either as energy sources or 
as energies is equally handicapped . The 
earnest attempts of the Hullians to ex- 
plain transposition phenomena in terms 
of stimulus generalization show the dis- 
taste for the inclusion of relationships 
among the stimuli. The Grazers found 
a place in the percept for relations, but 
did so by regarding them as imported 
into or imposed upon the situation by 
the perceiver. Both they and the Ber- 
liners agree in admitting relations as 
well as qualities in the percept. But 
the two disagree upon how relations get 
there. For the Berliners they are not 
the result of some “production process,” 
but are as primary as the qualities re- 
lated. 

There was a distinct inclination 
amongst the Ak¢ psychologists to adopt 
a propositional view. Brentano’s dis- 
tinction between the “lower” act of 
ideation and the “higher” act of judg- 
ment included a recognition that the 
object of the latter was propositional. 
Stout was quite explicit when he said: 


In general, it would seem that a complete ob- 
ject [of Consciousness] can be adequately de- 
scribed in language not by isolated words but 
by propositions capable of being asserted, 
denied, doubted or assumed. . . . When we are 
said to perceive or know something this may 
be taken to express the fact that we perceive 


or know that something is the case (44 
p. 100). a 


— 
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More recent formulations have not been 
as explicit. But the recent analyses by 
MacCorquodale and Meehl (31, 32) and 
Meehl and MacCorquodale (33) reveal 
that expectancy in Tolman’s theorizing 
is propositional in form. Insofar as 
expectancy is equivalent to hypothesis, 
as used by Tolman and Brunswik (47), 
and as later adopted by Bruner (9) and 
Postman (40), all four theorists may be 
claimed to adopt a:propositional view. 
Helson (23), in stè thir :daptation- 
level theory as a general theory of per- 
ception, gives some signs of subscribing 
to this view too. 

Descriptive versus abstractive modes 
of analysis. In offering a characteriza- 
tion of something in a way which dis- 
tinguishes it from other things, we may 
resort either to descriptive or to ab- 
Stractive terms. The former are fea- 
tures of the object actually before us 
when we inspect it, features that stand 
up for inspection, whereas the latter are 
not literally present in the form in which 
Our analysis might suggest they are. 
The Wundtians recognized most of the 
time that the elementary sensations with 
Which they characterized a given per- 
Cept were in fact abstractions. A pure 
Sensation, as they conceived it, is not 
Something that we ever experience. 
Likewise, the dimensions of experience 
which have been derived by Boring (6) 
from the later Titchenerian doctrine of 
Sensory attributes (46) are not com- 
Ponents of experience, but the dimen- 
Slons of a reference frame within which 
Some particular experience may be lo- 
Cated. On the other hand, features like 

| figure-and-ground, contour, and gradient 
| 9f texture are deemed by those referring 
to them to be the very material or 
Structure of our experience as we have 
Jt. They are descriptive terms. Wundt, 
je early Titchener, and the Leipsigers 
8enerally preferred elemental abstrac- 
tive analysis; the later Titchener, fol- 
lowed by Boring and Stevens and their 
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associates, have adopted a dimensional 
abstractive analysis; David Katz, Rubin, 
the Berliners, Gibson and the phenome- 
nologists generally have employed a de- 
scriptive and usually structural analysis. 

Preferred location of causal condi- 
tions. In seeking to give a causal ac- 
count of perception, the early theorists 
stressed proximal stimulus conditions, 
Later there was a considerable swing to 
distal stimulus conditions, as Brunswik 
(12, 14) pointed out. But both place 
the emphasis upon what Bruner and 
Postman (11) call autochthonous fac- 
tors, whereas these two, in concert with 
many others in the last decade or two, 
have emphasized needs and other re- 
lated “central” factors. There is, of 
course, another kind of “central” factor 
represented by the brain fields of 
Kohler and Wallach (28) and of Krech 
(29, 30), and by the cell assemblies of 
Hebb (22). The classification of per- 
ceptual theories contained in Allport’s 
treatise (1) is based primarily upon the 
determinants stressed in the theory. 

It is appropriate at this point to com- 
ment upon the unfortunate tendency of 
psychologists to identify causes with 
their effects, or to assimilate them. 
Werner and Wapner (48), in arguing 
that if both sensory and motor (tonic) 
processes interact they must be of the 
one kind, provide an example of as- 
similation. The widespread resort to 
dispositional concepts such as attitudes 
and sets usually involves an identifica- 
tion. No more is said about them than 
that they have specified observed effects 
and that they differ from one another 
in the effects they each have. Opera- 
tional or dispositional statements are 
well enough, but the trouble arises when 
the need for anything more is not felt. 
It is well enough to identify opiates by 
means of their soporific effects, but it is 
misleading to identify them with those 
effects. An interesting example of this 
sort of identification may be found in 
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the instructive controversy between Bor- 
ing (7, 8) and Gibson (17). Boring, 
supposing for argument’s sake our planet 
to be more liberally endowed with satel- 
lites, asks what we would be perceiving 
when we perceived the size of a moon. 
He is not a naive realist, and so he is 
not satisfied by the reply “Why, the 
size of the moon we happen to be look- 
ing at!” His own studies of the ap- 
parent size of our one and only pre- 
Sputnik moon led him to conclude that 
major determinants of such a perception 
are the actual distance, size, and eleva- 
tion of the moon, the perceived dis- 
tance of the moon, the actual elevation 
of the perceiver’s regard, and the per- 
ceiver’s attitude. He goes on, quite 
reasonably, to suppose that these values 
may be used as the parameters of some 
function, ø, which would be invariant 
when the perceived size of the moon is 
invariant. So far, to adopt a Boring 
phrase, this is good science. But it is 
Probably only a confusion of cause and 
effect when he says“... then, in judging 
size, you are Perceiving not object size, 
not retinal size, but ø. To discover the 
object of Perception, you have to dis- 
cover what function of the stimulus js 
invariant when the Perception is in- 
variant” (7, p. 146). 


A THREEFOLD CLASSIFICATION 
The historical associ 
these five basic iss; 
that a workable 
tion theories, 
and judgmenta 
one of these js 


ation of views on 


the rather tough- 
orists standing 
the more tender. 
gists. In the s 
these types of theory will be character- 
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ized in general terms. Attention will be 
paid to the relative strengths and weak- 
nesses of the discrimination and the phe- 
nomenalist theories. Finally, as part of 
an attempt to bring out what has usually 
been left implicit in judgmental theo- 
ries, it will be argued that this type of 
theory affords a hope of our being able 


to retain the best of the other two 
worlds. 


Discrimination Theories 


Discrimination theorists are ordinarily 
slightly on the activist side of center. 
They are ordinarily realists in that the 
stimulus (wherever it is to be located, 


distally or proximally, and however it is 
to be analy. 
and not 


e of the response. 
disposes of de- 
and attitudes by 
in experimental 
timuli. Boring, 
t thought of as 
tougher-minded phe- 
nomenalism towards discrimination the- 
ory, speaks of the Perceiver’s attitude as 


one of the stimulus Properties which is 
a Parameter of ø. 


Apart from a 
specific 


discrimination the- 
hose with the greater physio- 
terests tend to be more proxi- 
mally focused, and to look for percep- 
tual determinants largely in the sensory j 
mechanism. ‘Those who have been most 
influenced by modern positivism tend 
to have the more sophisticated view 
about the Stimulus, to have a greater 
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“focal length,” and to favor dimensional 
analysis through their concern with 
stimulus continua. Those who are led 
to their problems by practical considera- 
tions—such as the need to establish the 
perceptibility of signs, signals, displays, 
and the like—tend to be empiricist in 
the rule-of-thumb sense. 

This type of theory has many attrac- 
tions. Foremost amongst these is the 
scope provided for rigorous theorizing 
and rigorous experimentation. Another, 
of less certain value, is the avoidance of 
the bothersome problem of introspec- 
tion. As the aim of a perceptual experi- 
ment within this theoretical context is 
the testing of limits of discrimination 
and the seeking of invariant relations 
between stimuli and discriminative re- 
sponses, it does not matter what the 
responses be, whether verbalization, key- 
Pressing, or making matches; and it 
does not matter how they be established, 
whether by conditioning or by instruc- 
tion stimuli. When a verbal response is 
used, it does not have to be treated as 
a report. To treat it as a report the in- 
Vvestigator must concern himself with 
what it alleges and not with its own 
distinctive features as an event. Such a 
Concern raises the nagging doubts about 
intersubjective corroboration to which 
the discrimination theorist seems prone. 
Their avoidance in this way does not, 
however, preclude the discrimination 
theorist from dealing with many prob- 
lems that have often been deemed to be 
Subjective. For instance, it can be 
argued that, in establishing a subjective 
or phenomenal scale of brightness or 
loudness, the subject is being calibrated 
as a different kind of instrument for ap- 
plication to some physical variable (vide 
Bergmann and Spence, 3). 

The discrimination theorist also has 
the advantage of the phenomenalist in 
that he can offer some coherent account 
Of error, as Perry (39) and other Ameri- 


can realists have demonstrated. How- 
ever, saying that when in error the 
organism has made the “wrong” re- 
sponse does not go far enough. It needs 
to be shown in what way that response 
is wrong. When the outcome of an 
animal’s having made some response is 
the survival or the death of that animal, 
or even when the outcome is a shorter 
or longer period of delay in obtaining 
food or a mate, it may be said that the 
response made was right or wrong. 
However, there are occasions when an 
illusion contributes to survival or to a 
maximization of benefits, but is none- 
theless an illusion. What distinguishes 
it as an illusion is that the seeming 
knowledge in it is false. This same 
limitation obtains when this type of the- 
orist attempts an account of knowledge 
in the sense of our store of knowledge. 
Apart from the limitation of the disposi- 
tional terms employed, for instance, by 
Hull (25) and later Berlyne (5), no 
way is provided for a distinction be- 
tween true knowledge and false knowl- 
edge. 

The advantages gained by the es- 
chewal of introspection must have offset 
against them some handicaps accruing 
at the same time. It would appear help- 
ful to know that chromatic colors dis- 
appear in scotopia, and that it is not 
merely a matter of some one chromatic 
color replacing all other colors. A 
purely discriminative experiment could 
tell us no more than that, apart from 
any intensity differences, stimuli of one 
wave length are not discriminable from 
those of another including a mixture of 
all wave lengths. We could have no 
indication whether the central processes 
following different stimuli were the same 
or whether these processes were func- 
tionally equivalent though qualitatively 
different. The pattern of verbal re- 
sponses made with the addition of ap- 
propriate instruction stimuli would not 
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afford a clue to this unless we regard 
such responses as alleging what is the 
case, i.e., as being introspective reports. 
Finally, a discrimination theory usu- 
ally involves a renunciation of what 
seems to some the genuine and impor- 
tant psychological question expressed 
by Koffka (27) as: “Why do things look 
as they do?” This question seems pos- 
sible only if we are willing to consider 
introspective data and make some form 
of distinction between the real and 
the phenomenal. A distinction between 
tape-measured and subject-measured dis- 
tance, for instance, does not go far 
enough. But as soon as the discrimina- 
tion theorist begins to make the neces- 
sary admissions he is in danger of losing 
many of the advantages he had other- 
wise gained for himself. Berlyne sug- 
gests: “If we think of the perceptions 
which constitute the ‘phenomenal world,’ 
etc., as reactions to stimulation from 


physical energy-changes, and Overt be- 
havior in its turn as a reaction to them, 
we may be abl 


e to enjoy some of the 
benefits of both Points of view” (4, p. 
145 However, if Perception is 
as a response in th i 


a stimulus-producing 
al or inter- 


Phenomenalist Theories 
Phenomenalists ha 
over the active. 


temporary Tep 
than not com 


ve a broader spread 
-Passive variable, Con- 
Tesentatives more often 


down on the activist 
side, sometimes well out towards the ex- 


treme. They are never Tealists in pre- 
dominant flavor, although many struggle, 
as it were, towards realism. Generally 
speaking, the more distally focused have 
the stronger realist tendency. In the 


past they preferred elemental analysis, 
but now the predominant preference is 
for descriptive structural analysis. Once 
the basic classifying and counting had 
been done by the elementarists, the re- 
maining part of the program appeared 
tedious and sterile. Phenomenology has 
afforded a much richer alternative. . In 
both varieties, phenomena are conceived 
as terms—sensations when the focus 1s 
Proximal, and objects when it is distal. 
The favored determinants are stimulus 
Properties (though played down as com- 
pared with the discrimination theorists), 
other phenomena, brain states, and, with 
the more markedly activist, motives. 
There is some dispute, however, amongst 
them as to the legitimacy or profit in 
using one or other of these. For in- 
stance, Gibson ( 15) seems mainly con- 
cerned to find stimulus correlates and 
little impressed with the need to locate 
neural determinants, whereas Werner 
and Wapner (48) center their attention 
upon the total organismic state. The 
rocks that have seriously split phenome- 
nalism have been the issue of abstractive 
versus descriptive analysis and the issue 
of proximal versus distal focus. Though 
independent issues, those choosing ab- 
stractive analysis have usually chosen 
a proximal focus, and those choosing 


descriptive analysis have usually chosen 
a distal focus, 


Though an epistemological history of 


phenomenalism should go back at least 
as far as Locke’s distinction between 
primary and secondary qualities, a psy- 
chological history can appropriately be- 
gin with that part of Müller’s doctrine 
of the specific energies of the sensory 
nerves which stated that we are aware 


not of external objects, but of the states 
they produce in our sensory nerves. 
Withi 


n this doctrine a veridical knowl- 
edge of external objects would be at- 
tained to the extent that there is some 
Parallelism or isomorphism between the 


external objects and the states they 
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produce. When that parallelism breaks 
down we have error or illusory percep- 
tion. But how anything about the par- 
allelism or its absence can be discovered 
is a puzzle. It is not enough to say, as 
Miiller did, that vision is ordinarily 
elicited by light, and that when it is 
elicited by pressure on the eyeball there 
is an incongruity of stimulus and sensa- 
tion. To say this assumes that we may 
know light and pressure on the eyeball 
as other than phenomena. But even 
were there two sorts of knowledge—the 
sort we have in experiencing color and 
the sort we have in knowing that light 
is impinging on the eye—it is difficult 
to show how we would relate one to the 
other and to say in which realm of 
knowledge this relating occurred (vide 
Anderson, 2). 

There is an interesting paradox in the 
Phenomenalism of Miiller’s sort. Our 
knowledge of our sensory states must in 
one sense be veridical, i.e., in terms of 
the doctrine, we know only these states 
and can not know other than what is 
Present in them. Yet it almost always 
seems to be knowledge of external ob- 
jects, which, according to the doctrine, 
is a nonveridical impression. Further, 
what is said to be knowledge of the 
states of our sensory nerves gives us the 
knowledge we have of external objects 
and very little, if any, information about 
the states of our sensory nerves. The 
way we are able to get such knowledge 
is to have those states as the external 
objects of other sensory states. Müller, 
in effect, told us that it is not the moon 
that we see but a state of affairs in the 
visual mechanism, whereas Boring tells 
us that it is not the moon that we see 
but a mathematical function. Both 
seem to be confusing what we see with 
the conditions of our seeing. ` 

Though the later phenomenologists 
reject sensory states located at the or- 
ganismic surface, their objects are not 
literally out there. Their focus, as 
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Brunswik (14) shows, is not fully distal. 
The term “intermediate object,” which 
Brunswik used earlier in his work on 
object constancy, might well be applied 
with a slightly different meaning to the 
Phenomenal object in these theories. 
There remains, nevertheless, the impass- 
able gap between the phenomenal and 
the real. However, the phenomenologi- 
cal approach has the virtue of producing 
an orderly account of how things look. 
The now classical work of Katz, Rubin, 
and Wertheimer and the more recent 
work of Michotte (34) and Gibson (16) 
is evidence enough of this. Further- 
more, such work has produced data 
which seem amenable to corroboration 
and which set what has already been 
said to be a genuine psychological prob- 
lem. 


Judgmental Theories 


Perhaps some form of discrimination 
theory has greatest appeal to the physio- 
logically minded, and some form of 
phenomenalism to the psychologically 
minded, especially when in the ranks of 
the professional physiologists. Judg- 
mental theories have certainly been pro- 
pounded most often by the philosophi- 
cally minded. However, if, as will be 
asserted here, such a theory can ac- 
commodate some distinction of the sort 
attempted in the contrast between the 
real and the phenomenal, and at the 
same time preserve some of the ad- 
vantages of objective reference attained 
by discrimination theories, it may well 
be the type of theory the psychologist 
will find best adapted to his needs. 

Unfortunately, we have no contem- 
porary judgmental theory in sufficiently 
elaborated form to let us prove these 
hopes. Bruner (9) and Postman (40) 
gave separate statements of their Joint 
thinking which constitutes one promis- 
ing outline. However, much that is 
relevant to the claims being made is left 
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implicit in their theorizing. An ex- 
ein is rendered the more difficult 
by a more recent paper by Postman 
(41) which reveals a distinct swing 
away towards a discrimination theory, 
A later paper by Bruner (10) extends 
some aspects of the judgmental treat- 
ment but does not relate its terminology 
to that of the earlier statements. With- 
out asking these theorists to accept any 
responsibility for what is to be main- 
tained, many things which perhaps they 
thought best to leave implicit will be 
made explicit. Then, in the light of 
that interpretation, some indication will 
be given of where their theorizing seems 
to have gone off (or not far enough 
along) the track favored here, Finally, 
some promissory notes will be proffered 
in the absence of hard cash to capitalize 
the expansion of this type of theory. 
Bruner, Postman, and their associates 
have already a great deal of hard cash 
to the credit of other aspects of their 
theorizing. 
The full course of Cognition was said 
in 1951 to be an hypothesis- 
tion-testing cycle. 
plied not only to p 
thinking such as remembering, imagin- 
ing, and reasoning, This is an im- 
mediate attraction. All too often we are 
asked to conceive perception in one way 
and thinking, with all it entails, in so 
unrelated a way that we remain puzzled 
as to how what we acquire through per- 
ception can be incorporated 
thought. 


The central notion is “hypothesis,” 
which in Bruner’s later statement seems 
to be replaced by “category,” Hypoth- 
eses “serve to select, organize and trans- 
form the stimulus information” (40, p. 
249) ; or “stimulus inputs may be sorted, 
given identity, and given more elabo- 
rated, connotative meaning” in terms of 


categories (10, p. 148). Hypotheses 
are deemed to vary in strength, or cate- 


informa- 
This conception ap- 


erception but also to 


in our 
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gories to vary in accessibility. But 
what hypotheses or categories are is a 
matter left a little obscure. Bruner in 
general was and is content with a dis- 
positional conception—these are we- 
know-not-whats recognizable only by 
their determinants or by their effects. 
Postman went a little further in saying 
that an hypothesis is an expectation 
that such and such is so and so, or that 
such and such does so and so. He re- 
lates this to Tolman’s notion of ex- 
pectancy. MacCorquodale and Meehl 
have argued convincingly that an ex- 
Pectancy in Tolmanian theory has the 
form “if (S, and R), then Se”; eg., “if, 
when the light flashes, the lever is 
pressed, then a pellet of food falls into 
the cup.” This, of course, is an action 
hypothesis which is the sort required in 
a theory of performance. Where the ac- 
count puts the emphasis upon cognition, 
the “response component” may be ig- 
nored; thus “if S1, then Se.” Cognition 
of whatever type is to be regarded then 
as a predicating (asserting, denying, 


supposing, doubting and so on) Sə of 
Si; or, if no imputation of conscious 
deliberatio 


m is made, as a judging of S, 
It is important to stress that 
Sı and Sa may be quite complex, and 
that a distinction between a judgment 


and the verbal or other expression of it 
is intended. 


The judgment or predication may be 
true or false, depending upon whether, 
in fact, occasions of S; are occasions of 
Se. In illusion there is a false predica- 
tion. This arises in Several ways, one of 
which is fallacious inference, though the 
latter is more Prevalent in thinking than 
in perceiving, In an error of omission 
Si is judged to be S, alone when in fact 
Sı is also Ss, S4, etc. Thus we must 
recognise that there are, first of all and 
independently of our knowledge of them, 
the facts. These are the empirical con- 
tent of the term “real” in the traditional 


to be So. 
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distinction. When we judge that S, is 
Se and in fact it is, we have attained a 
knowledge of that fact. When we have 
attained a knowledge of some of the 
infinite facts confronting us and have 
failed, by omission and by commission, 
to attain the knowledge of others, we 
have the state of affairs emphasized by 
the term “phenomenal.” To say that 
we know only the phenomenal is true in 
the trivial sense that we know only what 
we know. But it should not be inter- 
preted as having more significance than 
that. It is demonstrable that, in a 
rough manner of speaking, the moon 
looks smaller than it really is. Further, 
if we assume, as some of our forebears 
did, that the law of visual angle is suf- 
ficient for the explanation of size per- 
ception, it may be said, equally roughly, 
that the moon looks bigger than it ought 
to. To ask, “What is this too small and 
too large thing that we see?” is to ask 
the wrong question. It assumes that 
there is such a too small and too large 
thing somewhere. As it is not out there, 
it is then concluded that in some sense 
it must be in here or somewhere in 
between. The question, “What do we 
see when we see the size of the moon?” 
is answerable properly not with “the 
real moon” (for that gives us no way of 
Coping with the anomaly we have 
noted), nor with “the retinal image of 
the moon” (for that is contradicted by 
experience), nor with “an intermediate 
or a phenomenal object” (for that is a 
Piece of fabrication which leads us into 
Confusion), but with “that the moon 1s 
size X,” or “that the moon is the size 
of X.” That this visual judgment 1s 
false, and that it varies with the position 
of the eyes in the head, are matters of 
reat interest to the psychologist—but 
Not because he studies phenomenal 
Moons, leaving the astronomer to take 
Care of the real ones. 

Frequently sense perception is re- 


garded as having the particular and the 
concrete as its content whereas thought, 
apart from its purely reproductive forms, 
is regarded as being characteristically 
concerned with the universal, the con- 
ceptual, and the abstract. The psy- 
chologists prone to make this distinction 
seem to have forgotten the long-stand- 
ing arguments about the nature of uni- 
versals, and not to have learned that 
the scholastic positions known as real- 
ism, nominalism, and conceptualism do 
not exhaust the possibilities. Discrimi- 
nation theorists, when dealing with cog- 
nitive matters other than perception, are 
prone to resort to a sort of nominalism: 
knowledge of universals is merely mak- 
ing the same response, a symbolic re- 
sponse, to several stimuli. Phenome- 
nalists of the elementarist kind tend to 
follow a similar line, substituting such 
phenomena as composite images. Phe- 
nomenologists in psychology turn regu- 
larly to conceptualism, Goldstein and 
Scheerer (18) affording an example 
singularly rich with scholastic vestiges. 
Bruner, in his theorizing about cate- 
gories, reveals an inclination towards 
conceptualism. Nevertheless, his dis- 
cussion has the merit of showing that 
perception is not confined to the par- 
ticular, and that a concern with the uni- 
versal is as characteristic of perception 
as it is of thought. A clearer recogni- 
tion that all cognition is judgmental 
would lead to a rejection of both nomi- 
nalism and conceptualism without forc- 
ing one into the mystification of medi- 
eval realism. What distinguishes the 
universal is that it is predicable of many 
things, a point recognized by Bruner in 
respect of his categories. However, this 
is the case with anything that is predic- 
able, even when it is a so-called con- 
crete particular. Some psychologists 
seem to think that there are some logical 
considerations requiring a special class 
of “things” called concepts. The nomi- 
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nalists know that there are no such con- 
siderations, but the psychologists among 
them fail to see that speaking of stimu- 
lus and response generalization does not 
take them far enough in accounting for 
knowledge. It is not that we confuse 
one member with another of its class, 
but that we know each of them to be 
things of the same sort. 

The explication given here of the term 
“hypothesis” (or “category”) reveals a 
diffculty in giving a qualitative account 
of hypothesis-strength (or category ac- 
cessibility) which has an important 
place in the theory. Though the opera- 

tional criteria provided especially by 
Postman suffice for the mounting of an 
extensive and fruitful experimental pro- 
gram, they are not adequate for a full 
explanatory use of the term. An hy- 
pothesis itself cannot vary in strength; 
Sı is simply S, but never more or less 
S2. No ground is gained by saying that 
the hypothesis is held more strongly un- 
less it can be said what holding an hy- 
pothesis is. To say that the hypothesis 
is used more or less frequently in certain 
situations does not explain what hypoth- 
esis-strength was meant to explain. The 
same strictures may be made with re- 
spect to habit-strength in Hullian theory, 
Should habits, now defined disposition- 
ally, turn out to be neural structures, 
then habit-strength could be some vari- 
able property of those structures. A 
judgment may have a similar neural 
determinagt, in which case hypothesis- 
strength could be given Positive mean- 
ing. This is a hope, however, with little 
factual backing at the moment, 

The information and test Phases in 
the cycle also need some clarification, 
At first sight the information Phase is a 
good deal like what we have all been 
calling perception. Were further con- 
sideration to confirm this impression, 
then this theory would be not so much 
an account of perception itself as of the 


wide context of processes in which it is 
embedded. Further, too much stress is 
placed by Bruner and by Postman on 
stimulation as such, that is, the impact 
of stimulus energy upon the receptor 
mechanism. It appears important to 
recall at this point the distinction made 
by Harper and Boring (21) between 
cues and clues. What is referred to as 
information should on some occasions 
be recognized as a nonperceived but 
sensorily mediated determinant of per- 
ception, and on others as a hypothesis 
(judged state of affairs) relevant through 
inferential processes to the hypothesis 
under test. The former type of infor- 
mation is a cue (which is all that is con- 
sidered in cybernetic theory), whereas 
the latter is a clue which may be a 
Premise as well as a mere determinant. 
It may well be that a similar distinction 
should be made in respect of the test 
Phase of the cycle, yielding something 
like reinforcement in that brand of 
learning theory with which discrimina- 
tion theories are most readily affiliated, 
and something like inference in Tol- 
man’s theory, which: js at least im- 
plicitly judgmental, The strengthening 
of an hypothesis is akin to acquisition in 
learning, and the weakening of an hy- 
pothesis is akin to inhibition and ex- 
tinction. 
The distinction between cue and clue 
may well help with the resolution of the 
Present paradox of siz 
ception. The bidime 
of itself encompass 
and projected shape, 
should thus be depend 
information about dis 
tion of shape upon 
tion about inclinatio 
amount of eyj 
former of these 


e and shape per- 
nsional retina can 
only visual angle 
Perception of size 
ent upon access to 
tance, and percep- 
access to informa- 
n. There is a fair 
dence to support the 
suggestions, though little 
available that is relevant to the latter- 
Several theorists, e.g., Koffka (27) and 
Boring (7), have been led to suppose 
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that there must be an invariant relation, 
to put it in Boring’s way, between, in 
the one case, perceived size, perceived 
distance, and retinal image size. Now 
an experiment like that of Holway and 
Boring (24) does not bear on this al- 
leged relation, for no distance judgment 
was obtained. An experiment like that 
of Gruber (20) does bear on it; that it 
falsifies the alleged relationship is 
ameliorated only by its inherent design 
defects. Stavrianos (43) failed, despite 
a commendable persistence, to show 
that errors in perceived shape were re- 
lated to perceived inclination. Her ex- 
periment, too, has some basic design 
defects. Nixon (38), in a series of stud- 
ies free of these limitations, has falsi- 
fied the hypothesized relationship. The 
puzzle here would be resolved were it to 
turn out that this is an “overdeter- 
mined” situation in the sense that there 
is more information, not all of which is 
congruent, than the minimum required 
for a given judgment of size or shape. 
Some of this information is in the form 
of a judgment of distance or of inclina- 
tion. This is a clue to size or to shape 
when it is conjoined with information 
such as retinal size or retinal shape. It 
may operate as a clue through some in- 
ferential process, as Helmholtz would 
have maintained. In addition, there 
may be cues to distance (inclination is 
a matter of relative distances) which on 
this occasion do not influence the judg- 
ment of distance or of inclination, but 
which may influence the judgment of 
size or of shape. If ocular torsion 
is relevant to the moon illusion, it 
would be just such a cue: the paradox 
that Gruber leaves was left also by 
Boring in respect of that illusion, 
namely, that the changed apparent size 
of the moon is due to a changed cue to 
distance, even though the apparent dis- 
tance has been inversely affected. The 
way a cue affects a judgment would not 


be through some inferential process. 
Inference seems peculiar to discourse or 
to knowledge. It is not merely a matter 
of cause and effect. When we speak of 
cues, no more than a causal relation is 
being referred to. The nearness of the 
horizon moon is possibly the result 
(through inference) of its appearing so 
large. 

Were a considerably modified notion 
of response developed, one able to em- 
brace what Woodworth (49) called a 
“mental” response as well as what he 
called a “motor” response, the proposal 
made by Berlyne (4) would have 
greater merit. As was pointed out 
above, a pure stimulus act such as he 
had in mind will not suffice. Nor, of 
course, is it enough to say that the per- 
ceptual response is to be characterized 
as discriminative. Little responding, 
whether cognitive or otherwise, is in- 
discriminate. The kind of response 
needed is judging that S, is Se. To fit 
such a concept on to Hullian theory re- 
quires rather more general renovations 
than Berlyne suggests. However, if it 
can be done, we should have, as he is 
hoping to have, the best of the two 
worlds of discrimination theory and phe- 
nomenalism in coping with perceptual 
problems. 
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The purpose of this paper is to at- 
tempt a theoretical formulation of atti- 
tude development from a concept-forma- 
tion point of view. Mediating responses 
have been employed by Osgood (21) to 
explain concept formation, and a medi- 
ating or implicit response has also been 
used as a main theoretical construct in 
Doob’s (8) outline of attitude theory. 
To this extent, there is already an estab- 
lished theoretical link between concepts 
and attitudes. 

Doob calls attitude “an implicit drive 
producing response considered socially 
significant in the individual’s society” 
(8, p. 136). This viewpoint has been 
criticized by Chein for a number of rea- 
sons, including the fact that it does not 
clearly indicate the evaluative nature of 


ould rather call at- 


agreement is not 
20) review of the 
more or less dif- 
titude, and more 
d expand the list 


: definition seems 
clearly superior solely on logical grounds; 


if it were otherwise, there would not be 
so many alternatives, 

The various definitions n 
structs, and presumably some constructs 
have more scientific value than others. 
The psychologist, seeking a guide for 
his research among many alternatives, 
looks for a construct with heuristic and 
predictive value, particularly a con- 
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struct which leads to informative re- 
search. Insofar as attitude change is 
concerned, Doob’s type of approach has 
stimulated considerable research. But 
in the area of attitude development, 
very little carefully controlled experi- 
mentation has been done. Besides the 
research of Eisman (9), there is little 
experimentation available on attitude 


acquisition, although studies such as 
those by Horowitz 


tude devel- 
lled experi- 
pt-formation 
formation approach 
also enables any 
which can account 
n to account at the 


same time for attitude formation. 


THE GENERAL MEANING oF AN 
ÅTTITUDE 


imes thought of as 
hanism that repre- 


Simi- 
a person who speaks 
Y has dark skin, thick 
lips, and kinky hair” classifies Ameri- 
can Negroes for Some people. Of course, 
the concepts learned in real life are not 
always verbalizeq into a precise prin- 
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ciple. In fact, behavior in laboratory 
studies (19) indicates that a concept 
can be learned even though S is unaware 
of it and is unable to verbalize the prin- 
ciple on which the concept is based. 
The same phenomenon probably occurs 
with attitudes. People often betray by 
their behavior attitudes which they do 
not recognize as their own. 

On a common-sense level, concepts 
are developed over a period of time 
through a series of experiences. The 
child learns the concept cow because his 
parents and others point to objects in 
the field or in pictures and call them by 
that name. He learns that color alone 
is not the key to cow-ness, but that it is 
helpful. All black objects are not called 
cow, but green objects are never called 
by that name. The child soon learns to 
discriminate between cows and other ob- 
jects as he learns that cow-ness is a 
feature of not one stimulus pattern but 
of a whole class of patterns. 

The child learns about people in 
much the same manner. He discovers 
that one dimension of Negro-ness is 
dark skin, but that a deep suntan does 
not make one a Negro. He gradually 
learns that Negro-ness involves other 
dimensions such as facial characteristics 
and hair properties. This learning 
eventually enables him to make fairly 
accurate discriminations between Ne- 
groes and other persons. So far, the 
child has a concept of a Negro but not 
an attitude, because an attitude involves 
an evaluative dimension. Suppose, how- 
ever, he has some unpleasant persona 
experiences with Negroes or receives 
adverse information about them from 
his parents. Then the child’s concept 
might be based on the dimensions, “dark 
skin, kinky hair, thick lips, speaks with 
a drawl, and bad.” Since an evaluative 


dimension now appears in the concept, 


he has acquired an attitude. 
This approach to attitudes is not 
arbitrary substitute for other meanings 
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of attitude, but is representative of com- 
monly used meanings, as may be seen 
from the following quotations: 

“Most researchers in attitudes agree 
that one of the functions of attitudes is 
to integrate social perceptions and ex- 
periences. Thus our experiences with 
our mothers, sisters, girl friends and 
maiden aunts are integrated . . . into an 
attitude toward women” (16, p. 292). 
“, . . attitude [is] a more or less perma- 
nently enduring state of readiness of 
mental organization which predisposes 
an individual to react in a characteristic 
way to any object or situation with 
which it is related” (1, p. 13). “The 
attitude is a way of conceiving an ob- 
ject; it is the mental counterpart of an 
object” (10, p. 11). “... the meaning 
of a concept is its location in a space 
defined by some number of factors or 
dimensions, and attitude toward a con- 
cept is its projection onto one of these 
dimensions defined as ‘evaluative’ ” (22, 
p. 42). “An attitude, roughly, is a 
residuum of experience, by which fur- 
ther activity is conditioned and con- 
trolled . . . an inner mental organization 
takes place which predisposes the person 
to a certain type of activity towards ob- 
jects, persons, and situations” (17, p. 
238). 

The thread which runs through these 
quotations—“experiences integrated into 
an attitude,” “mental organization,” 
“mental counterpart,” “a way of con- 
ceiving”—is that an attitude is a kind 
of mental organization that is built 
through many experiences. In a sense, 
as a result of varied contacts with a class 
of objects, a concentrated representation 
of a set of stimuli is left within the per- 
son, which is the essence of what is gen- 


erally meant by a concept. 


An EXPLANATION OF ATTITUDE 


If an attitude is taken to be a concept 
with an evaluative dimension, its mean- 
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ing may be derived from an explanation 
of concept formation. Consequently, 
any theory which can explain concept 
formation can also help to explain at- 
titude development. One of several pos- 
sible explanations of concept formation 
is provided by S-R Theory. 

Borrowing from Osgood (21), a con- 
cept may be defined as the associations 
between a common response (often ver- 
bal) and a set of stimuli. These stimuli 
frequently comprise a class of phenom- 
ena which display certain common char- 
acteristics, The common characteristic 
of the set of phenomena may be the 
identical elements with which Hull (14) 
worked, such as a common shape em- 
bedded in each of several different Chi- 
nese characters, 
has demonstrated, 
perceptual relations when, for instance, 
Ss view a series of different figures and 
classify together all those h 


The mediating response, Hull’s pure 
u 


stim be any fraction of a 
total response Pattern, Partly as a re- 


apacity to dis- 


an organism, the m and implici 
Impl 
can become the ‘detachable? Teac} Bore 


included in the stable mediation process. The 
higher the organism in the evolutionary scale, 
the finer the discriminations it can usually 
make and the less gross its representing proc- 
esses. Similarly, the more mature and intelli- 
gent the human individual, the less overt his 


nations that characterize language behavior are 


Nature’s farthest step in this direction (21, 
P. 398). 


Of all possible human mediators, 
Probably the Most significant are verbal 
mediators, Research by Cofer and 
Foley and associates (5, 6, 7, 12), Riess 
(28, 29), and others demonstrates the 
significance for language behavior of 
mediating Tesponses. Other investiga- 
tions by Reed (23, 24, 25), Kendler and 
by Goss and asso- 
the importance of 


become organized 
It is this system of 
n the stimuli and the 
Process which is the 


into one system, 


concept, the stimuli 
Produced by the first-order mediators 
are themselyes associated with another 
common mediator. One set of first- 
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A. FIRST-ORDER B. SECOND-ORDER 
CONCEPT CONCEPT 


Fic. 1. A stimulus-response explanation of concepts and attitude. S,= evaluative stimuli; 
m, =a first-order mediator; fı =a first-order evaluative mediator; rs=a second-order medi- 


C. ATTITUDE 


ator; s=the stimulus produced by a mediator. 


order stimuli in Part B of Fig. 1 might 
be different skin shades leading to the 
mediator representing dark skin; the 
second set of first-order stimuli might 
be various lip thicknesses which are as- 
sociated with the mediator for thick 
lips; and each of the stimuli from these 
mediating responses may be associated 
with the mediator for Negro (rə). This 
illustrates a second-order concept, but 
there is no attitude as yet because no 
evaluation is involved. 

There is an attitude when the medi- 
ator of at least one of the first-order 
concepts is an evaluative reaction. Thus, 
attitudes are equivalent to that special 
class of concepts which is distinguished 
by the inclusion of an evaluative dimen- 
sion. Part C of Fig. 1 is a simplified 
illustration of an attitude. Suppose, as 
in the example of a second-order con- 
cept, that one 7; of this figure is the 
mediator for thick lips, that the second 


Fic. 2. A: Major associations between the attitude 
Fig. 1), and verbal (rv) and affective (rz) mediators. 


Prejudice through displaced aggression. 


rı represents dark skin, and that ro .is 
the mediator for Negro. Suppose also 
that the stimuli (S,) of the evaluative 
dimension are dirty, stupid, and rude 
and that 7s, the evaluative reaction, is 
bad. Now the stimuli associated with 
Negro are dark skin, thick lips, and bad. 
The inclusion of an evaluative dimension 
identifies Part C of Fig. 1 as an attitude. 

The mediating response produces a 
stimulus which may then become con- 
ditioned to other mediating or instru- 
mental reactions. Part A of Fig. 2 
shows some of the major associations 
possible between the attitude and these 
other mediating responses. It indicates 
how instrumental responses may depend 
upon a chain of associations involving 
two major types of mediators, namely, 
affective and verbal. Part A of Fig. 2 is 
not meant to imply that in any given 
instance all the illustrated associations 
between the attitude and verbal or affec- 
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(At = the attitude as shown in Part A of 
B: A stimulus-response explanation of 
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tive mediators will be formed. Instead, 
it indicates that one or more of the as- 
sociations may be learned by a given 
individual. For instance, as is shown in 
Part B-1 of Fig. 2, only some of the 
relations between the attitude, affect, 
and the instrumental response may be 
learned in an individual case. 

One implication of Fig. 2 is particu- 
larly interesting. A reaction conditioned 
to the attitude may be aroused by a 
stimulus which is associated with affect 
but not with the attitude. This situa- 
tion suggests an explanation for preju- 
diced responses through displaced ag- 
gression. Suppose an individual has a 
negative attitude towards Negroes. Sup- 
pose further that the affect of Part B 
of Fig. 2 is the mediator for anger and 
the instrumental response is aggression 
toward Negroes. Assume also, as Part 
B-1 of Fig. 2 indicates, that the attitude 

leads to the simultaneous arousal of 
anger and of aggression against Ne- 
groes. Eventually, th 


g rough condition- 
108, anger alone can lead to aggression. 
Now, as indicated by Part B-2 of Fig. 2, 


any stimulus (Sn) which arouses anger 
will also have a tendenc 


‘oes, depending 


l ch this stimulus 
1S supported by other stimuli. 


ency to displace 
aggression from Sn, the real source of 
anger, to Negroes, 

Other specific Predictions follow from 
the general relations shown in Part A of 
Fig. 2. The relationship between verbal 
mediating responses ind affect suggests 
one reason why attitudes are commonly 
accompanied by affective reactions. If 
a given verbal mediator, through Past 
experience, has become associated with 
affect, then the arousal of this mediator 
by an attitude will also lead to an affec- 
tive response: the attitude leads to the 

verbal mediator and the verbal Mediator 
is associated with affect. Suppose, for 
instance, that an attitude supplies the 
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conditioned stimulus which evokes the 
verbal mediator for green vomit. Then, 
insofar as this mediator evokes an emo- 
tional reaction, the attitude will be ac- 
companied by affect. This particular 
verbal mediator was used as an example 
because in some cases it will make the 
illustration more vivid by actually arous- 
ing an emotional reaction in the reader. 

There is probably a more important 
Teason why attitudes are commonly 
linked to affect. It is quite often true 
that the experiences through which 
the attitude is developed occur in a 
charged atmosphere. In this case, the 
mediating response of the attitude is 
continuously aroused in conjunction with 
an emotional reaction, and the stimulus 
Produced by the mediating response be- 
comes a conditioned stimulus capable of 
evoking affect. As a result, the expres- 


Sion of this attitude is accompanied by 
an emotional response. 


A METHOD For STUDYING ATTITUDE 
DEVELOPMENT 
One of the main 


} values of the con- 
cept-formation approach to attitude is 
that it Suggest: 


S a simple laboratory 
method for studying attitude develop- 
ment. If an attitude js a kind of con- 
cept, then its acquisition may be in- 
vestigated by the methods of concept 
formation. Such a method, depending 
on an evaluative dimension in the con- 
cept, has been employed by the writer. 
In order to Construct an evaluative 
dimension, 100 Ss Were asked to rate 4 
large number of descriptive traits in 
terms of their desirability in a person. 
Some example traits are: ambitious, bor- 
ing, calm, intelligent, obedient, and vain. 
All traits showing less than 90% agree- 
ment as to their desirability were dis- 
carded. The Temaining traits can be 
employed to Study the development of 
an attitude and to measure its strength 
and resistance to change. To do this, 2 
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group of Ss is told that anthropologists, 
missionaries and physicians have visited 
a group of people about which little is 
known, and that all the visitors agree on 
certain characteristics of this group of 
people. When these instructions are 
concluded, Ss are shown a number of 
traits, one by one, and are asked to 
predict whether each trait is or is not 
characteristic of the people. After each 
prediction they are told whether or not 
the word is indeed characteristic. For 
example, to develop a positive attitude, 
E would say “characteristic” after pre- 
senting each trait previously rated as 
desirable and “not characteristic” for 
words previously rated undesirable. 

If S has learned the positive attitude, 
which might be labeled “pro-ness,” he 
would be expected to respond accord- 
ingly. Hence, when he has completed 
the first list of traits, S is asked to con- 
tinue responding in the previous manner 
to a second list of words without being 
told which traits are characteristic or 
not characteristic. Responses under 
this condition are taken as a measure of 
attitude strength. It is assumed that a 
strongly acquired, positive attitude will 
lead to a high proportion of positively 
rated traits being called “characteristic” 
and to most negatively rated traits being 
called “not characteristic.” 

Finally, resistance to attitude change 
is measured by using a third list of 
traits with E again saying “character- 
istic” or “not characteristic” after each 
Prediction. But this time Z’s responses 
are reversed. For example, if S has 
Previously acquired a positive attitude, 
change may be accomplished by calling 
“not characteristic” all positive traits in 
the third list and “characteristic” all 
negative traits. 

This method is derived from the con- 
Cept-formation approach to attitudes. 
The meaning of the many traits seen in 
a single session varies considerably, as 


can be observed from the examples given 
above. Although the traits are not 
synonyms, they do exhibit one common 
aspect, the evaluative element. The Ss 
are able to respond correctly when this 
evaluative element is consistently rein- 
forced by E because the traits are stim- 
uli which are capable of arousing a com- 
mon, evaluative mediating response. 
The mediating process, in turn, deter- 
mines S’s further predictions of ‘“‘char- 
acteristic” or “not characteristic.” 

The stimulus function of mediating 
responses in attitude development is 
demonstrated by two experiments which 
also illustrate the mediating response 
approach as applied through the con- 
cept-attitude method. A negative at- 
titude was developed in both investiga- 
tions. In one study (27), consistency 
of reinforcement was defined as the sum 
of the number of positive traits called 
“not characteristic” by Æ and the num- 
ber of negative traits called “character- 
istic,” divided by the total number of 
traits used during attitude development. 
Using five levels of consistency, it was 
shown that both attitude development 
and strength increase as a function of 
consistency of reinforcement, as would 
be predicted from the approach to at- 
titude development based on mediating 
responses. The results for attitude 
change also suggest that attitudes ex- 
hibit properties of mediating responses. 
Research on acquired distinctiveness of 
cues (18) indicates that strong habits 
reverse more easily than weak ones 
when the cues arousing them have be- 
come distinct through learning. The 
distinctiveness of the stimulus dimen- 
sion is believed to be related to the de- 
gree to which the stimuli are conditioned 
to a mediating process. The evaluative 
element of the stimulus traits should be 
most distinct when reinforcement is 
fully consistent; consequently, attitude 
change should be easiest under this con- 
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dition. The Ss receiving the greatest 
consistency of reinforcement did in fact 
show least resistance to change. 

A second experiment (26) illustrating 
the concept-attitude method investigated 
the effect on attitude development of peer 
responses. The traits were used with 
100% and 84% Consistency of rein- 
forcement to develop a negative atti- 
tude. Some Ss made their predictions 
after first hearing the responses of three 

fellow students who were E’s confed- 
erates, and other Ss made their predic- 
tions in private. The confederates’ re- 
sponses prior to §’s predictions were 
always or almost always in contradiction 
with the statement Æ made after each 
prediction. Thus, when the confederates 
said a trait was characteristic, it was 
Pretty certain that E would say the 


The Ss 


Apparently, peer responses Provided cues 
which were an aid to Si 


attitude. This seemed to occur 


mmon, evalua- 
With 100% 
t, a negative 
sed by a trait 
t and in con- 
junction with peer responses of “char- 
acteristic.” Similarly, a positive medi- 
ating reaction is aroused in conjunction 
with peer responses of “not character- 
istic.” In time, as a result of condition- 


ing, the peer responses can be expected 
to help cue the appropriate positive or 
negative mediating response. With 84% 
consistency of reinforcement, the trend 
of conditioning is the same, but a minor- 
ity of trials tend to establish competing, 
mediating responses: the same peer re- 
actions are sometimes associated with 
positive mediating responses and some- 
times with negative. Under these condi- 
tions, peer responses are a less stable 
and less predictive aspect of the stimu- 
lus environment. This would account 
for the statistically significant effect 
from peer responses with high consist- 
ency of reinforcement, which is dimin- 
ished with lesser consistency, 

Staats and Staats (31) have also 
i itudes using an approach 
5 mployed in the preced- 
mg two studies. The associated given 
» and nationalities, 
h positive or nega- 
Subsequent rat- 


the names (or 


tive scale. 

The studies deri 
theory of attitude 
bility of bringing 
under laborat 
generally r 
tude devel] 


ved from the concept 
S illustrate the possi- 
attitude development 
ory scrutiny. Despite the 
ecognized significance of atti- 
l lopment, as distinguished from 
attitude ¢ ange, the literature reveals 
relatively little laboratory evidence bear- 
img Upon its nature. Tt is hoped that 
the concept theory of attitudes and the 
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method it implies will help provide a 
means through which attitude develop- 
ment will be more accessible to experi- 
mental investigations. 


SUMMARY 


A theory of attitude development 
based upon concept formation was dis- 
cussed, and similarities between this 
view and earlier meanings of attitude 
were pointed out. An attitude was de- 
fined as a concept with an evaluative 
dimension, and the mediating response 
was employed to explain attitude learn- 
ing. A method for inducing an attitude 
in the laboratory was described along 
with illustrative studies employing this 
method. 
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SYMPTOMS AND SYMPTOM SUBSTITUTION 


AUBREY J. YATES 
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In recent years there has been a num- 
ber of attempts to derive rational meth- 
ods of symptomatic treatment of the 
behavioral disorders, based mainly on 
learning theory (10, 12, 13, 15, 18, 19) 
and information theory (2). Neverthe- 
less, symptomatic treatment has not 
been favorably considered by most clini- 
cal psychologists, and the failure to ap- 
preciate its possibilities is perhaps best 
shown by the almost total neglect of 
Dunlap’s classic work (3) on habit-for- 
mation and the remedial treatment of 
undesirable habits. The object of this 
paper is to point out some misconcep- 
tions on which objections to sympto- 
matic treatment are based, and to show 
that such treatment can in fact be firmly 
founded on modern learning theory. 

The argument that symptomatic treat- 
ment is a waste of time has had as its 
strongest proponent the psychoanalytic 
school, which has insisted that symp- 
toms are but the surface indicators of 
underlying conflicts, anxieties, etc. This 
argument has been accepted by most 
“dynamic” psychologists, who concluded 
that removing the symptom without also 
effectively treating the anxiety or Con- 
flict underlying it would lead to symp- 
tom substitution. For example, even if 
tics could be treated directly and extin- 
guished, the patient would quickly de- 
velop either new tics or some other form 
of symptomatic response. Even ortho- 
dox psychiatrists who have little pa- 
tience with Freudian psychodynamics 
have accepted this point of view. Thus 
Kanner writes that “symptomatic ther- 
apy which does not care what the com- 
plaint is a ‘symptom’ of is rarely suc- 
cessful and leads to unwarranted pes- 
simism” (9, p. 244). Mowrer has 


summed up the position neatly and abu- 
sively: 

. .. the behavioral manifestations which we 
now call “symptoms” are not the essence of 
neurosis and . . . the modes of treatment that 
are aimed at their direct alleviation are as ill- 
considered theoretically as they are futile prac- 
tically (14, p. 620). 


This point of view has become so 
widely accepted that until recently al- 
most no symptomatic treatment was car- 
ried out, except in the educational field 
of remedial teaching and speech ther- 
apy. However, the distinction between 
a “fundamental” underlying anxiety and 
a “superficial” surface symptom is open 
to a number of objections. 

1. The term “symptom” has been em- 
ployed very loosely in psychology. In 
general medicine a clear distinction is 
made “between symptoms, that is dis- 
orders described by the patient, and 
physical signs or the deviations from 
the normal revealed by examination” 
(7, p. 1). In other words, a symptom 
is a subjectively experienced abnormal- 
ity, a sign is an objective indication of 
abnormality. In many cases the symp- 
tom may, of course, also be a sign. 

Mowrer, having accepted, as we have 
seen, Freud’s conclusion with regard to 
the meaning and function of symptoms, 
was considerably embarrassed by his 
own empirical finding that “sympto- 
matic” treatment of enuresis was not 
only 100% successful with regard to 
the symptoms, but was not followed by 
symptom substitution in a single case! 
Instead, however, of rejecting Freud’s 
hypothesis, Mowrer took the extraordi- 
nary step of denying that enuresis was 
a symptom in any of the cases he 
treated. He distinguished between habits 
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which were also symptoms, and habits 
which were not. Thus, 


t, s” differ from ordinary “habits” pri- 
nei e that the former are motivated by 
and are perpetuated because they lessen the 
drive of anxiety, whereas “normal” habits are 
motivated by and are perpetuated because 
they lessen other drives, such as hunger, thirst, 
cold, fatigue, sex, fear, etc. (14, p. 546). 


Now the above distinction is clearly a 
meaningful and possibly a valid one. 
As far as the author is aware, however, 
Mowrer has not presented any evidence 
that the distinction can be sustained in 
the form in which he has put it, and his 
own experimental results seem strongly 
to contraindicate the validity of any 
such distinction, Mowrer’s acrobatics 
in this regard are as good an example 
as any of an a priori clinical bias being 
allowed to outweigh cogent experimental 
evidence, 

It is clear that, as used by Freud (6), 
Mowrer (14), and other “dynamic” psy- 
chologists, the symptom is regarded as 
a response to some hidden, basic con- 
flict or anxiety. It will be suggested 
that this “vertical,” dualistic approach 
in unnecessary and should be replaced 


by a “horizontal,” monistic approach, 
as described below, 


a predisposition exists, th 
that a neurosis is symypto 
ing else, i.e., that a parti 
consists of a particular 
responses; and that trea 


en it follows 
ms and noth- 
cular neurosis 
set of learned 
tment on psy- 


1Mowrer admits that it js impossible to 
“know from the incontinence alone that it js 
a symptom rather than simply a reflection of 
pedagogical inadequacy” (14, P. 414). Pre. 
sumably, therefore, decision as to whether the 
incontinence is a symptom or a habit can be 
determined only by carrying out “sympto- 
matic” treatment and seeing what happens. 


chological grounds can only be sympto- 
matic, since treatment of the predispo- 
sition must ultimately be by genetic or 
chemical means. 

3. Finally, it should be noted that be- 
lief in symptom substitution seems to 
be based largely on clinical experience. 
Considering the significant role such a 
distinction has played in clinical psy- 
chology, experimental demonstration of 
its existence is singularly lacking. We 
conclude, therefore, that the reasons 
generally given for rejecting sympto- 
matic treatment in favor of “dynamic” 
treatment are not soundly based. Nev- 
ertheless, the definition of a neurosis as 
a system of learned responses is not in 
itself incompatible with the notion of 
symptom substitution, We must now 
turn to the question whether there are 
any theoretical reasons to suppose that 
symptom substitution would take place, 
if symptomatic treatment were to be un- 
dertaken. It will be maintained that 
learning theory Suggests both that symp- 
tomatic treatment is a rational pro- 
cedure and that symptom substitution 
would not be expected except under cer- 
tain specifiable conditions, 

his may Perhaps be most easily 
shown by reference to some recent work 
on the Symptomatic treatment of tics 


(cf. 19 for details). A model was con- 


It was hypothesized that 
the tic is a Conditioned avoidance re- 
sponse to anxiety, the latter being also 
a learned response (the conditioned form 


fear response). A 
ent was derived in 
n concepts of Ir and 
of massed practice.” 
is theory, the follow- 

*In this study (19) it was found convenient 
to use the Concepts of Ir and sIr in deriving 
a method of treatment, This aspect could 
also, however, haye been conceptualized in 
terms of two-factor learning theory, as Mowrer 
has shown in Connection with studies in the 


field of general motor behavior (14, p. 169 
Footnote 4), 
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ing points are specifically relevant to 
this discussion: 

a. Both the anxiety and the tic are 
learned responses. Now it would be a 
logical error to suppose that the fact 
that the tic is a response ¢o the anxiety 
necessarily means that the anxiety must 
therefore be more fundamental or that 
it must receive prior treatment. It 
seems more reasonable to regard the 
situation simply as one in which two 
sets of responses (anxiety and the re- 
sponse to anxiety) are learned over 
time. This may be termed the “hori- 
zontal” approach, as opposed to the 
dualistic “vertical” approach rejected 
above. 

b. From this point of view it does not 
much matter which aspect of behavior 
(anxiety or tic) is treated first. The 
fact is, however, that in our present 
state of knowledge it is easier to treat 
the tic than the anxiety. The former 
is more readily manipulated and meas- 
ured, and, since it usually involves mo- 
tor behavior, is easily dealt with in re- 
lation to the concepts of learning the- 
ory. It turns out, however, that in 
treating the symptom we are also treat- 
ing the anxiety. For suppose the symp- 
tom is destroyed; then the anxiety, in- 
stead of being reduced by it, will per- 
sist. But since in fact no traumatic 
event occurs if the anxiety is allowed to 
persist, the latter, being itself a learned 
response, will begin to extinguish. This 
is what in fact seems to happen, though 
the principle of partial irreversibility of 
traumatic anxiety (17), if valid, sug- 
gests that extinction of the anxiety 
would never be complete.* 

This formulation is, of course, derived 
directly from Mowrer’s two-factor the- 


® The nature of traumatic avoidance learn- 
ing is a very complex problem, which has 
been greatly oversimplified here. The reader 
Is referred to recent papers by Solomon and 
Brush (16) and by Solomon and Wynne (17) 
for comprehensive discussions of the current 
Position. 


ory of learning. That removal of the 
symptom will affect the learned anxiety 
is hinted at by Mowrer, when he ex- 
plains why a symptom 


is at one and the same time both self-per- 
petuating and self-defeating. It is self-per- 
petuating because it is reinforced by the satis- 
faction provided through the resultant anxiety 
reduction; and it is self-destructive in that it 
prevents the individual from experiencing the 
full force of his anxiety and being modified by 
it in such a direction as to eliminate the oc- 
casion for the anxiety (14, p. 535, Footnote 
3; italics not in original). 


Supposing, therefore, both the symptom 
and the anxiety to be alleviated, we 
would be left with a basically neurotic 
person without a current behavior dis- 
order. But, in terms of learning theory, 
there would be no reason to suppose 
that this person would develop new anx- 
ieties or symptoms unless put into a 
new traumatic situation. This would, 
of course, involve knowledge of what 
situations would be likely to be trau- 
matic for a particular patient.* 

Such empirical evidence as there is 
supports the position outlined above. 
Symptom substitution has not been 
found in those instances where a single 
patient has been subjected to intensive 
investigation and symptomatic treat- 
ment (e.g., 8, 12, 19); nor in those few 
instances where large-scale follow-up has 
been carried out on groups of subjects. 
For example, two recent investigations 
(1, 11) of the results of symptomatic 
(pace Mowrer) treatment of enuresis 
indicate 70%-80% success, and a fail- 
ure to find symptom substitution over 
periods up to two years. Mowrer him- 
self (13) reported 100% success. The 
critical point seems to be that we are 
faced in such cases with a vicious circle 
(as Mowrer has indeed pointed out): 
the tic, for example, is a response to 


€In the case of the patient treated by the 
author, for example, any situation involving 
anaesthesia (such as dental or medical opera- 
tions) would be potentially traumatic. 
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anxiety, but performance of the tic in- 
creases the general level of drive (irrita- 
bility, annoyance, etc.), which increases 
the rate of responding, and so on. The 
important factor seems to be the break- 
ing of this vicious circle; it does not 
much matter whether this is accom- 
plished by treating the anxiety or the 
symptom, since both react upon each 
other. Thus, in the shadowing method 
described by Cherry and Sayers (2) for 
the treatment of stammering, the mere 
fact of demonstrating to the patient that 
he can speak normally (even though un- 
der special conditions and before treat- 
ment has started) usually has a very 
striking effect on the patient’s general 
well-being. In Many instances it seems 
that the symptom itself generates a good 
deal of anxiety, which can be alleviated 
by a direct attack on the symptom. 
The seriousness of the Present situa- 
tion is clearly demonstrated by Ey- 
senck’s recent contention (5, p. 271) 
that, in spite of the experimental valida- 
tion of symptomatic treatment of enure- 
sis, this technique is not used in any 
child guidance clinic in Great Britain, 
It is obvious that a Teorientation js 
highly desirable, in which the “Vertical” 
approach is abandoned, the use of the 
term “symptom” dropped altogether, 
and the patient considered as an indi- 
vidual who has developed a series of 
learned responses to Certain situations, 
Clinical psychologists may then perhaps 
begin to apply that large body of knowl- 
edge and theory to be found in general 
psychology in which they are specially 
trained, and apply the Principles of þe- 
havior to the abnormal field as well as 
to the normal. 
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It is the purpose of this paper to 
bring together several current concepts 
arising from the study of transfer of 
training and to consider their implica- 
tions for the study of perceptual learn- 
ing and recognition. A number of 
studies have indicated an increasing 
emphasis on relations between these two 
areas (1, 2, 3, 12, 13, 21, 22, 27, 38), 
and several theoretical formulations 
have been advanced to account for cer- 
tain phenomena in perceptual learning 
and recognition in terms of existing prin- 
ciples of transfer (12, 21, 26, 37). In 
several instances, however, the reverse 
of this latter kind of account has been 
advanced; that is, certain phenomena 
in transfer of training have been held 
to support theoretical formulations 
grounded upon assumptions involving 
processes of perceptual learning (3, 15, 
16, 17, 20). 

In the first kind of view, recognition 
or discrimination tasks have been em- 
ployed to provide evidence for hypothe- 
ses concerning changes in stimulus-re- 
sponse relations which take place during 
learning. This view is illustrated nicely 
by the study by Arnoult (2). Four 
groups of subjects learned to associate 
verbal labels with nonsense forms. In 
one group the labels were nonsense 
syllables; in a second group the labels 
were girls’ names; in a third group the 


1 This research was supported in part by the 
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AF 41 (657)—47, monitored by the Operator 
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reproduction, translation, publication, use, 2 
disposal in whole or in part by or for the 
United States Government. 


labels were descriptive names given the 
forms by an independent group; in the 
fourth group the labels were names in- 
vented by the subjects themselves. Fol- 
lowing this labeling practice, the sub- 
jects were given a recognition test, in 
which they were required to select from 
a group of forms those for which they 
had learned the labels. Recognition 
was found to increase as a function of 
the degree to which the responses in the 
paired-associates task (learning labels) 
reflected associations made to the forms. 
The implication from this kind of study 
is that recognition, as a perceptual task, 
is dependent (at least in part) upon the 
degree to which responses, in this case 
labeling responses, transfer from the 
first task to the task of recognition. 
Clearly, then, from this point of view, 
the process of recognition is intimately 
related to what may be regarded as 
stimulus-response relations, consistent 
with long-standing associationistic the- 
ories. 

In the second kind of view, the con- 
verse of the above statement is usually 
made; transfer comes about as a func- 
tion of underlying perceptual processes. 
This view finds support in studies of the 
kind made by Gagné and Baker (15). 
In this study, subjects were given pre- 
liminary practice in associating a differ- 
ent letter to each of four different lights. 
Following this practice the subjects en- 
gaged in a second task in which the 
same lights were used as stimuli as were 
used in the first task, but the responses 
were changed to require the subjects to 
manipulate a different switch when each 
of the four lights appeared. The scores 
(time and errors) were found to vary in 
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which was related to the amount 
ara in the first task. The rate 
at which the subjects learned to respond 
by manipulating the correct switch in- 
creased as a function of practice in as- 
sociating labels, and significant differ- 
ences in errors were found as a function 
of the amount of first-task practice. 
These results were interpreted by the in- 
vestigators as supporting the view that 
practice in labeling brought about a re- 
duction in generalization in responses 
among the various intratask stimuli. In 
their view, the practice increased the de- 
gree to which the stimuli were differ- 
entiated from each other. A similar 
point of view, although based on a some- 
what different mechanism, is contained 
in the well known hypothesis of Miller 
and Dollard (33) of acquired distinc- 
tiveness of cues. 

Although these two kinds of views 
have not always been stated explicitly, 
they appear in a variety of theoretical 
formulations (to be examined later in 
this paper) and in a number of studies 
of transfer of training (9, 11, 29, 30), 
stimulus predifferentiation (1, 3, 6, 15, 
16, 20, 38), acquired distinctiveness of 
cues (21, 33, 39), and in studies of per- 
ceptual learning (7, 12, 13, 38). It is 
evident that these views are closely re- 
lated. The first view mentioned above 
implies a close relation between verbal 
and motor responses, qua responses, and 
responses which are presumed to reflect 
or to influence perceptual responses, 
The second view implies that verbal and 
motor responses may be related to stim- 
ulus variables which influence percep- 

tual processes. Thus, we might expect 
that perceptual learning and recognition 
are activities which are understandable 
in terms of transfer of training and, con- 
versely, that transfer of training is de- 
pendent upon (or, in the least, involves) 
processes similar to those encountered 
in studies of perceptual learning and 
recognition. However, as obvious as 


these relations, stated in this broad 
sense, appear at first, several difficulties 
arise almost immediately to plague the 
investigator who attempts to interpret m 
any clear fashion the results of experi- 
ments in these areas. 3 

Taken as a group, the studies cited 
above tend to support a common con- 
viction among psychologists that trans- 
fer is involved in practically all phe- 
nomena beyond those of the simplest 
reflex variety, Conversely, however, 
they also support an equally common 
conviction that perception is involved in 
practically all forms of activity, in one 
way or another. 

But if one examines carefully the 
frameworks of these convictions, one en- 
counters several difficulties, First of all, 
the principles of transfer of training 
themselves do not find unequivocal sup- 
port in experiments designed to test 
them. For example, a frequently cited 
principle of transfer, made most ex- 
plicit in the formulation of Osgood (36), 
is that when responses are different in 
two tasks, transfer will occur from one 
task to the other in a fashion inversely 
related to the degree of similarity of 
the responses in the tasks. Positive 
transfer will occur when the responses 
are similar, and zero or negative trans- 
fer will occur when the responses are 
unrelated or antagonistic. But a direct 
test of this principle, in a 
by Bugelski and Cadwallader (10), in 
which responses were vari 
range, yielded data contradictory to the 
principle. 

Second, specification of the ways in 


which responses, gua responses, in trans- 
fer formulations on 


responses presu; 


tion on the other hand, are related, is 
far from clear. One not only finds per- 
ceptual responses interpreted as instru- 
mental activity dependent upon inferred 
responses of an identification type (26), 
but one also finds instrumental responses 
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interpreted as resulting from processes 
involving changes in perception (20). 
These differences in interpretation of the 
activities only lead to confounding of 
the investigator interested in a clear 
statement of principles. 

Third, the variables which are em- 
ployed to represent the various concepts 
in these views are often subject to in- 
terpretations which support alternative 
views with almost equal likelihood. In 
Arnoult’s experiment, for example (2), 
the response of recognition (selection of 
the correct nonsense form) could be in- 
terpreted as resulting either from reduc- 
tion in generalization or from facilitation 
of the correct response (qua response), 
brought about directly by way of the 
intervening identification response that 
had been made during the practice in 
labeling. A similar interpretation could 
be made of the results of the experiment 
by Gagné (15), cited above. The cor- 
rect switching response could result 
either from reduction in generalization 
(predifferentiation), acquired distinctive- 
ness (33), or from an increase in likeli- 
hood of the correct response due to its 
relation to the response in the first task 
(9, 30). Thus, although recognition 
may be interpreted broadly as resulting 
from one or another of the principles of 
transfer, the difficulties mentioned here 
suggest that it is in good order to Te 
examine some of the hypotheses ge? 
erated from the various studies. 


THE RELATION BETWEEN TRANSFER OF 
TRAINING AND RECOGNITION 


Despite a considerable divergence in 
point of view and method between 
studies of transfer and recognition, they 
involve similar experimental operations.* 


2 Difference in point of view concerning 
similar operations may be illustrated by con- 
sidering a situation in which a person en- 
counters a woman, approaches her, and says, 
“Hello, Mary.” If the woman were not in 
fact named Mary, the response might be re- 


Their similarities may be illustrated by 
considering the following paradigms.’ 


Recognition 


Training or experience is provided by 
engaging the subject in a task in which 
he is asked to observe, label, draw, or 
otherwise respond to a form, a pattern, 
or other stimulus. Several such stimuli 
may be used. Following training, he 
may be asked in a second task to select 
those stimuli used in the first task from 
among a group containing both the old 
and some new stimuli. 

This paradigm has been employed in 
a number of studies designed to test 
hypotheses about effects of predifferenti- 
ation or acquired distinctiveness train- 
ing upon transfer, in both verbal and 
motor tasks (1, 2, 3, 9,12). Ina study 
by Vanderplas and Garvin,* for example, 
training was employed in which sub- 
jects were required to learn nonsense 
labels for a series of nonsense forms. 
Following this labeling practice, a test 
of recognition was employed in which 
the subjects were required to select the 
forms for which labels were learned 
when imbedded in a set of forms con- 
taining both the old forms and several 
similar ones. Several conditions were 
employed, including amount of practice 


garded as a false recognition. But if the 
woman were one of a group of women and if 
the person had the task of learning the names 
of each woman, the response “Hello, Mary” 
might be regarded as a case of positive trans- 
fer, provided that another woman (in a previ- 
ous situation) had been correctly called Mary 
the person. f 
z Tho epera controls necessary in this 
and the following paradigm are not cited here, 
since they are intended to show similarities. 
Controls vary with the phenomena under in- 
vestigation, and serve to — experi- 
nts rather than points o view. 
we report of this research, entitled “Com- 
ion Value, and Practice as 


lexity, Associati 1 c 
Factors in Recognition Following Paired-As- 


sociates Verbal Training,” is now in prepara- 
tion. 
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and certain physical variations in the 
forms. The study, however, illustrates 
the use of the paradigm as a test of the 
hypotheses mentioned above. As an- 
other example, the study by Kurtz and 
Hovland is of interest: in their study 
(27), subjects who verbalized the names 
of common objects were found to recog- 
nize them with greater certainty in a 
second task than a group who did not so 
verbalize. The results thus supported 
the hypothesis that the naming response 
increased the probability of correct rec- 
ognition. For the present purpose, the 
study illustrates once again the point of 
view, mentioned earlier, that recognition 
may be influenced by the relation that 
it bears to responses made in the ini- 
tial task. The paradigm, as described 
above, has also been employed in studies 
of concept formation (8) and in the 
more classical experiments on form and 
pattern perception (40, Ch. 23). In 
studies where this paradigm has been 
employed, it is usually inferred that the 
response of recognition is dependent 
upon its relation to responses (most 
often of an identification type) made 
during practice on an initial task, and 


thus closely related to existing principles 
of transfer of training. 


Transfer of Training 


Training is provided in which the sub- 
ject is required to 


onsense syl- 
atterns. Fol- 
S required to 


ich may be of 
one or more of the following types 


(symbolized in S-R terms, for con- 
venience) : 


Type SRSR: The subject is asked 
to respond in the second task as he 
did in the first, to the same stimuli, 
Type SRsR: The subject is asked 
to respond in the second task as he 


did in the first, but to stimuli dif- 
ferent from those of the first task. 
Type SRSr: The subject is asked to 
respond differently in the second 
task to the same stimuli used in the 
first. 

Type SRsr: The subject is asked to 
respond differently in the second 
task to stimuli different from those 
used in the first. 


In these paradigms, instructions to the 
subject may include directions to re- 
spond as above, or he may be given only 
general instructions such as those pro- 
vided in Paired-associates training where 


he is asked to attempt to anticipate each 
response item in a list. 


The paradigms have b 
employed in studies of 
tion, acquired distinctiveness of cues, 


It is often in- 
ntal results, in 
studies employing these Paradigms, that 
the response employed in the first task 
facilitates Performance of the second if 


d. For example, 
in a study by McAllister (30), it was 


rbal response em- 


een extensively 


paradigms are thus con- 
rms of the stimulus-re- 
ations in the two tasks, sev- 
eral points of close correspondence in 
the points of view mentioned earlier 
become apparent, First of all, in the 
recognition paradigm, the subject is 
asked to respond in the second task 
(usually by implication) in two ways: 
(a) as he did in the first to the same 
stimuli, and (b) differently to different 


sidered, in te 
sponse rel 
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stimuli. If he responds correctly with 
the responses that he learned during 
first-task training, it is predicted by the 
transfer theorist that the response of 
recognition will be facilitated. Con- 
versely, if recognition is found to im- 
prove following practice in labeling, say, 
it is inferred that the response of label- 
ing facilitated recognition. These re- 
quirements, implicitly made in the rec- 
ognition paradigm, are tantamount to 
imposing, simultaneously, tasks of Type 
SRSR and SRsr of the transfer para- 
digm. Thus, experiments on recognition 
which implicitly impose these require- 
ments may be regarded as forms of 
transfer tasks which involve simply dif- 
ferential response. Conversely, transfer 
experiments which impose similar tasks 
may support a view that differential re- 
sponses arise from processes of recogni- 
tion. 

To emphasize the relations between 
these paradigms, consider also the fol- 
lowing. 

1. Tasks of Type SRSR require re- 
tention of a stimulus, a response, OF a 
specific S-R relation from the first task 
to the second. They also require recog- 
nition of the stimulus in order that the 
correct response can be made. 

2. Tasks of Type SRsR may require 
generalization of a response from first- 
task stimuli to similar stimuli in the 
second. Failure of recognition or dif- 
ferential discrimination would result in 
Positive transfer. 

3. Tasks of Type SRSr may require 2 
different response to the same stimuli in 
the second task. Failure to recognize 
a stimulus as old could result in positive 
transfer; i.e., the subject would respon: 
differently, as required. Correct recog- 
nition would result in negative transfer, 
since the subject would respond with the 
old response, and interference would re- 
sult. 
4. Tasks of Type SRsr require differ- 
ential responses to different stimuli in 
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the two tasks. Recognition should result 
in just such performance. Zero trans- 
fer would be predicted, however, since 
no old stimuli are present to evoke the 
recognition response. 

To summarize, recognition and trans- 
fer may be regarded as directly related, 
inversely related, or not related at all, 
depending upon the relations between 
the stimuli, responses, or both, in the 
two tasks. In general (but only in gen- 
eral), they may be regarded as related 
by way of the common requirement of 
differential response to different stimuli 
and its relation to activities which re- 
flect recognition. With these points in 
view, let us examine some current prin- 
ciples of transfer as they relate to the 


study of recognition. 


APPLICABILITY OF PRINCIPLES OF 
TRANSFER TO THE STUDY 
oF RECOGNITION 


A number of investigators have at- 
tempted to account for transfer of train- 
ing in verbal and motor tasks by ap- 
plication of hypotheses derived from 
studies of perceptual phenomena, pri- 
marily those involving discrimination. 
At the same time, hypotheses arising out 
of association theory (31, 32, 37), stud- 
ies of concept formation (8), and S-R 
formulations (21) have been invoked to 
account for data otherwise presumed to 
reflect perceptual processes. Although 
they have seldom been advanced in pure 
form, we may distinguish three diver- 
gent points of view on these questions. 

First, classical theory regards transfer 
to be a function of S-R relations, and 
to be dependent upon similarity rela- 
tions between stimuli or responses in 
the two tasks. This view is clearly seen 
in the transfer and retroaction model of 
Osgood (36), to which we have already 
made reference. A second view is seen 
in the studies of stimulus predifferentia- 


tion. Transfer is now regarded as a 
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function of changes in discriminability 
of stimuli, either as a result of reduction 
of generalization gradients (18) or asa 
result of increased distinctiveness arising 
from the addition of response-produced 
cues (21, 33). This view is presented 
in the well known hypothesis of Miller 
and Dollard (33), and in the extended 
formulation of Goss (21). A third view 
regards transfer as resulting from cer- 
tain mediational Processes, including 
verbal associations to task stimuli. This 
view has received emphasis in the recent 
discussion by Postman, who holds to the 
point that perceptual learning may be 
encompassed by association theory, by 
way of an account which includes verbal 
response processes (37). To what ex- 
tent each of these divergent views is 
relevant to the study of recognition isa 
question of some concern. We shall 
next attempt to examine these formula- 
tions with this question in mind, 


S-R Stumarrry RELATIONS AND 
TRANSFER 


Osgood, in his treatment of transfer 
(36), regards three Principles to be of 
primary importance in determining the 
direction and extent of transfer: (a) 
when responses in the two tasks are 
(functionally) identical, Positive trans. 
fer is obtained and increases with in- 
creasing similarity of the stimuli; (b) 
when the stimuli in the two tasks are 
(functionally) identical, with responses 
varied, negative transfer is obtained and 
decreases with increasing Similarity of 
the responses; (c) when both stimuli 
and responses are varied, negative trans- 
fer is obtained and increases with in- 
creasing stimulus similarity, 

It is apparent from the foregoing dis- 
cussion of the paradigms cited that 
these principles, paraphrased from Qs- 
good’s discussion of the model (36, pp. 
134-135), are partly principles of recog- 
nition. However, some qualifications 
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need to be mentioned. First, recogni- 
tion may be regarded as a case of trans- 
fer in terms of the paradigm for tasks 
of Type SRSR, since both stimuli and 
responses are identical and should lead 
to correct recognition. But failure of 
recognition could also be regarded as 
Positive transfer, since the identical re- 
sponse, made to a new stimulus, would 
lead to an incorrect selection (recogni- 
tion) response. The second principle 
would apply to recognition, if a subject 
first learned a label for, say, a form and 
then was asked to press a switch when 
that form again appeared. This would 


between the label and the switch-press- 
ing response, 

arise over one’s 
between the Tesponses. A number of 
experiments, for example, have indicated 


of these Principles involyes the implicit 
at transfer is related to 
In terms of the model, 


stimuli which are least discriminable are 


recognition. This Point need not be 
of major Concern, but care needs to be 


Thus, we see 
may be used in 
of the findings 
tion. On the o 
recognition tas 
these Principle. 


that S-R formulations 
Part to account for some 
from studies of recogni- 
ther hand, some facts of 
ks are not in accord with 
S—indeed, even some facts 
of transfer are Not, as noted earlier (10) 
—and additional hypotheses may be 
needed. Some of these added hypothe- 
Ses may be found in studies of transfer 


—_— m m 


TRANSFER OF TRAINING 381 


as related to discrimination, discussed 
in the next paragraphs. 


STIMULUS DISCRIMINABILITY AND 
TRANSFER 

Some studies have indicated that 
transfer may result from improvement 
in discriminability of stimuli following 
training. The hypotheses of these 
studies involve two different kinds of as- 
sumed processes. The first kind, repre- 
sented in the papers of Gibson (17, 19) 
and Gibson and Gibson (20), assumes 
that differential reinforcement during 
learning serves to reduce the gradient of 
stimulus generalization. The second 
kind, which appears in the formulation 
of Miller and Dollard (33), assumes 
that the stimuli become more “distinc- 
tive” as a result of the addition of re- 
sponse-produced cues generated during 
learning. Both of these hypotheses in- 
volve the assumption that the operations 
of a learning experiment provide for 
development of increased stimulus dis- 
criminability. Both also postulate that 
discriminability thus developed persists, 
and is effective in a transfer task which 
involves differential response. Goss 
(21) has extended these postulates to 
include response relations in the two 
tasks. He assumes that responses made 
during the first task give rise to stimuli 
which, added to the task stimuli, make 
them more distinctive. He also assumes 
that these response-produced stimuli 
may be conditioned to the response of 
the transfer task, in addition to or inde- 
pendent of other conditioning. Accord- 
ing to this view, if an instrumental re- 
sponse has been conditioned to a stimu- 
lus in the past, and if that stimulus is 
aroused by a response made in a learn- 
ing task, transfer to a new task will 
occur by way of the relation between 
the response-produced stimulus and the 
instrumental response of the transfer 
task. Thus, in the experiment by Gagne 


and Baker, cited earlier (15), the trans- 
fer found could be accounted for on the 
basis that the labels learned in the first 
task facilitated establishment of distinc- 
tiveness by becoming part of the stimu- 
lus-complex employed in the second task. 
The positive transfer found by McAl- 
lister (30) could result from the pre- 
established relation between the label 
learned in the first task and the motor 
response of the second. 

These principles may also be em- 
ployed to account in part for the data of 
recognition studies. Added distinctive- 
ness would reduce generalization and 
thus reduce the possibility of false rec- 
ognitions in the second task. Pre-exist- 
ing relations between responses made to 
the stimuli would facilitate appropriate 
instrumental responses which reflect rec- 
ognition. In the experiment by Arnoult, 
for example, the addition of the label 
during paired-associates training could 
be regarded as facilitating discrimina- 
tion of the old stimuli from the new, and 
this improved discrimination would re- 
duce the likelihood of selection of a new 
form. In McAllister’s study, increased 
relatedness of the response label to the 
response of switch selection, and the cor- 
responding increase in transfer, could 
be regarded as reflecting correct recog- 
nition. 

Here, as in the previous discussion, 
some qualification is needed. In terms 
of the paradigms cited earlier, recogni- 
tion is conceived as depending upon dif- 
ferential response to discriminative stim- 
uli. Pressing a correct switch when an 
old stimulus is presented is facilitated, 
presumably, by recognition resulting in 
part from the response of labeling. 
Pressing an incorrect switch may reflect 
failure of recognition and result in nega- 
tive transfer. Conversely, as in the 
previous examples, recognition could re- 
sult from positive transfer of a labeling 
response learned during the first task, 
and failure of recognition could result 
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similarly. The interpretation of, a 
recognition experiment requires careful 
analysis of the S-R relations involved 
if they are to be used to account for the 
results. 


VERBAL MEDIATION AND TRANSFER 


Several studies have suggested that 
transfer may be mediated by verbal as- 
sociations, established either between 
the stimuli of the first task and the re- 


sponses of the second or between the’ 


responses of the first task and the re- 
sponses of the second. 


Noble (34) and Noble and McNeely 
(35) have shown that verbal stimuli 
(words, syllables, or paralogs) are 
learned more rapidly if they also have 
greater numbers of associations made 
to them by independent groups of sub- 
jects. Postman (37) has also empha- 
sized the view that associations made to 
stimuli lend meaning to them and facili- 
tate transfer and improve discrimina- 
bility. This latter view is taken by 
Postman as an alternative hypothesis, 
concerning the development of discrimi- 
nation processes, to that of Gibson and 
Gibson (20). As such, the association- 
istic hypothesis represents an additional 
one which may apply to recognition as 
well. Facilitation of transfer would re- 
sult, in this view, from enrichment of 
the stimuli, by way of the responses 
made to them during the first task. 
Recognition would also, Presumably, be 
improved by similar enrichment. Stated 
another way, the facilitation could re- 
sult from an increase in the probability 
of occurrence of a mediating association 
between the stimuli of the first task and 
the responses of the second. In Ar- 
noult’s experiment, for example, the im- 
proved recognition of the forms for 
which the subjects invented their own 
labels may have come about as a result 
of the use of labels having increased 
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meaningfulness (2) as well as belonging- 
ness, as postulated (2). 

In general, according to this view, re- 
sponses to old stimuli would be facili- 
tated by virtue of associations formed 
during learning, and responses to new 
stimuli would occur only as a result of 
generalization from old stimuli, on the 
basis of similarity, as in Osgood’s model 
(36). Similarly, if associations were 
formed between the responses of the 
learning and transfer tasks, transfer and 
recognition would be facilitated in a 


manner such as that postulated by Goss 
(21): 


IMPLICATIONS FOR THE STUDY or 
RECOGNITION 


The formulations of transfer that we 
have outlined above may be seen to in- 
volve four kinds of assumptions which 


are generalizable to the study of recog- 
nition: 


1. Transfer depends on the degree to 
which responses learned to old stimuli 
generalize to new stimuli. This assump- 
tion is evident in the empirical model of 
Osgood (36). Recognition, similarly 
conceived, would depend upon the dis- 
criminability of the stimuli, but would 
be inversely related to transfer. Selec- 
tion responses, made to new stimuli, 
would reflect positive transfer, but also 
failure of discrimination and false recog- 
nition. As a working hypothesis, this 
Proposition appears fairly obvious, but 
it is not easy to test in all cases. The 
question of Specification of relevant 
stimulus dimensions against which to 
relate discriminability is particularly dif- 
ficult to answer, and only a bare be- 
ginning has been made in the develop- 
ment of physical variables which de- 
scribe complex stimuli. In the case of 
forms and Patterns, for example, physi- 
cal variables have not been clearly de- 
scribed in Many studies, and invariant 
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properties of “form” which distinguish 
them from other “forms” have not been 
adequately investigated (4, 5, 28) in- 
dependently of learning (5) or judg- 
ments (23). Alteration of structure 
(24), judgments of meaning (23), and 
similar variables have been used as 
measures of “similarity,” but without a 
defined physical scale. The efforts of 
Attneave and Arnoult (5) to develop 
measures of forms are important ad- 
vances in this direction, as are other 
similar attempts (28). But greater ef- 
forts are clearly required before the re- 
lation between discriminability (25) and 
recognition (and transfer, for that mat- 
ter) may be clearly established. 

2. Transfer depends on the extent to 
which the stimuli of the training task be- 
come differentiated or distinctive, either 
as a result of reduction of generalization 
(17) or as a result of addition of re- 
sponse-produced cues (21, 33). To the 
extent that recognition is dependent 
upon discriminability, by these hypothe- 
ses it too would be facilitated and re- 
lated to transfer, as in Paragraph 1 
above. The added hypothesis here, of 
course, is that discriminability may be 
improved by differential response train- 
ing, and that it persists and facilitates 
recognition. 

3. As a corollary to the first hypothe- 
sis above, we would add that transfer 
and recognition are dependent upon the 
degree to which response-produced stim- 
uli are themselves discriminable or dis- 
Clearly, in Goss’s formulation, 
-produced stimuli are not 
ill not add dis- 
Here, 


tinctive. 
if the response 
discriminable, they W 
tinctiveness to the task stimuli. 
as in the first hypothesis, the specifica- 
tion of physical measures of similarity 
of such stimuli needs to be made before 
adequate tests of this hypothesis may be 
conducted. More often than not, re- 
Sponse-produced cues have been in- 
ferred, and more direct establishment of 
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their existence and their measurement 
would constitute an important advance. 

4. Transfer and recognition are de- 
pendent upon associative connections, 
formed during learning, and their rela- 
tions both to the first-task stimuli and 
to the second-task responses. The prob- 
ability of arousal of such associations 
and their relations to the second-task 
responses would need to be manipu- 
lated in systematic fashion in order to 
test this hypothesis. Although evidence 
exists to support the hypothesis in part, 
variation of both aspects (probability 
of occurrence and relation to the second- 
task response) would need to be carried 
out in the same experiment to obtain a 
clear view of the relative contribution 
of each. Birge, for example, found that 
a naming response, learned for nonsense 
pictures, facilitated transfer of a reach- 
ing response from one picture to another 
for which the same name was learned 
(9). Buzzotta (11) was able to alter 
the probability that an association would 
occur to a response word used in a first- 
task practice period, and this probability 
was related to transfer in a second task 
involving different responses. Vander- 
plas and Garvin‘ found that recogni- 
tion was related to the degree to which 
the stimuli of a paired-associates task 
(nonsense forms) elicited associations in 
an independent group used to scale the 
forms for association value. Independ- 
ent variation of these variables in the 
same group of subjects would extend 
knowledge about their importance to a 


considerable degree. 

The above hypotheses are not re- 
garded as mutually exclusive, in the 
sense that if one is true others are not. 
Rather, they may be conceived as repre- 
senting stages in the development of 
transfer or recognition as a result of 
first-task practice. To the extent that 
one begins with discriminable stimuli, 
differential responses may be made to 
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different stimuli, thus (perhaps) increas- 
ing their discriminability. Associations 
made to the stimuli or responses may 
add to the likelihood of transfer, dis- 
crimination, and recognition. Differ- 
ential response-produced cues, them- 
selves discriminable, may further add to 
these processes, by way of their likeli- 
hood of occurrence or their likelihood 
of evoking instrumental responses upon 
which recognition depends. The rela- 
tions among the hypotheses are further 
questions to be answered empirically. 
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THE PERCEPTRON: A PRO 


BABILISTIC MODEL FOR 


INFORMATION STORAGE AND ORGANIZATION 
IN THE BRAIN? 


F. ROSENBLATT 


Cornell Aeronautical Laboratory 

If we are eventually to understand 
the capability of higher organisms for 
perceptual recognition, generalization, 
recall, and thinking, we must first 


have answers to three fundamental 
questions: 


1. How is information about the 
physical world sensed, or detected, by 
the biological system ? 

2. In what form is information 
stored, or remembered? 

3. How does information contained 
in storage, or in memory, influence 
recognition and behavior? 


The first of these questions is in the 
province of sensory physiology, and is 
the only one for which appreciable 
understanding has been achieved. 
This article will be concerned pri- 
marily with the second and third 
questions, which are still subject to a 
vast amount of speculation, and where 
the few relevant facts currently sup- 
plied by neurophysiology have not yet 
been integrated into an acceptable 
theory. 

With regard to the se 
two alternative Positions have been 
maintained. The first suggests that 
storage of sensory information is in 
the form of coded representations or 
images, with some sort of one-to-one 
mapping between the Sensory stimulus 


1 The development of this theory has been 
carried out at the Cornell Aeronautical Lab- 
oratory, Inc., under the Sponsorship of the 
Office of Naval Research, Contract Nonr- 
2381(00). This article is primarily an adap- 
tation of material reported in Ref. 15, which 
constitutes the first full report on the Program, 


cond question, 
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and the stored pattern. According to 
this hypothesis, if one understood the 
code or “wiring diagram” of the nerv- 
ous system, one should, in principle, 
be able to discover exactly what an 
organism remembers by reconstruct- 
ing the original sensory patterns from 
the “memory traces” which they have 
left, much as we might develop a 
photographic negative, or translate 
the pattern of electrical charges in the 
“memory” of a digital computer. 
This hypothesis is appealing in its 
simplicity and ready intelligibility, 
and a large family of theoretical brain 
models has been developed around the 
idea ofa coded, representational mem- 
ory (2,3,9,14). The alternative ap- 
Proach, which stems from the tradi- 
tion of British empiricism, hazards the 
guess that the images of stimuli may 
never really be recorded at all, and 
that the central nervous system 
simply acts as an intricate switching 
network, where retention takes the 
form of new Connections, or pathways; 
between centers of activity. In many 
of the more recent developments of 
this position (Hebb’s “cell assembly,” 
and Hull's “cortical anticipatory goal 
response,” for example) the “re- 
sponses” which are associated to 
stimuli may be entirely contained 
within the CNS itself. In this case 
the response represents an ‘‘idea 

rather than an action. The impor- 
tant feature of this approach is that 
there is never any simple mapping of 
the stimulus into memory, according 
to some code which would permit ae 
later reconstruction. Whatever in- 


e, 
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formation is retained must somehow 
be stored as a preference for a par- 
ticular response; i.e., the information 
is contained in connections or associa- 
tions rather than topographic repre- 
sentations. (The term response, for 
the remainder of this presentation, 
should be understood to mean any 
distinguishable state of the organism, 
which may or may not involve ex- 
ternally detectable muscular activity. 
The activation of some nucleus of cells 
in the central nervous system, for 
example, can constitute a response, 
according to this definition.) 
Corresponding to these two posi- 
tions on the method of information 
retention, there exist two hypotheses 
with regard to the third question, the 
manner in which stored information 
exerts its influence on current activity. 
The “coded memory theorists’ are 
forced to conclude that recognition of 
any stimulus involves the matching 
or systematic comparison of the con- 
tents of storage with incoming sen- 
sory patterns, in order to determine 
whether the current stimulus has been 
seen before, and to determine the ap- 
propriate response from the organism. 
The theorists in the empiricist tradi- 
tion, on the other hand, have essen- 
tially combined the answer to the 
third question with their answer to the 
second: since the stored information 
takes the form of new connections, Or 
transmission channels in the nervous 
system (or the creation of conditions 
which are functionally equivalent to 
new connections), it follows that the 
new stimuli will make use of these new 
pathways which have been created, 
automatically activating the appro- 
priate response without requiring any 
separate process for their recognition 
or identification. 
The theory to be presented here 
takes the empiricist, or “connectionist 
Position with regard to these ques- 


tions. The theory has been developed 
for a hypothetical nervous system, or 
machine, called a perceptron. The 
perceptron is designed to illustrate 
some of the fundamental properties of 
intelligent systems in general, without 
becoming too deeply enmeshed in the 
special, and frequently unknown, con- 
ditions which hold for particular bio- 
logical organisms. The analogy be- 
tween the perceptron and biological 
systems should be readily apparent to 
the reader. 

During the last few decades, the 
development of symbolic logic, digital 
computers, and switching theory has 
impressed many theorists with the 
functional similarity between a neuron 
and the simple on-off units of which 
computers are constructed, and has 
provided the analytical methods nec- 
essary for representing highly complex 
logical functions in terms of such 
elements. The result has been a 
profusion of brain models which 
amount simply to logical contrivances 
for performing particular algorithms 
(representing “recall,” stimulus com- 
parison, transformation, and various 
kinds of analysis) in response to 
sequences of stimuli—e.g., Rashevsky 
(14), McCulloch (10), McCulloch & 
Pitts (11), Culbertson (2), Kleene 
(8), and Minsky (13). A relatively 
small number of theorists, like Ashby 
(1) and von Neumann (17, 18), have 
been concerned with the problems of 
how an imperfect neural network, 
containing many random connections, 
can be made to perform reliably those 
functions which might be represented 
by idealized wiring diagrams. Un- 
fortunately, the language of symbolic 
logic and Boolean algebra is less well 
suited for such investigations. The 
need for a suitable language for the 
mathematical analysis of events in 
systems where only the gross organ- 
ization can be characterized, and the 
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precise structure is unknown, has led 
the author to formulate the current 
model in terms of probability theory 
rather than symbolic logic. 

The theorists referred to above were 
chiefly concerned with the question of 
how such functions as perception and 
recall might be achieved by a deter- 
ministic physical system of any sort, 
rather than how this is actually done 
by the brain. The models which have 
been produced all fail in some im- 
portant respects (absence of equi- 

potentiality, lack of neuroeconomy, 
excessive specificity of connections 
and synchronization requirements, 
unrealistic specificity of stimuli suffi- 
cient for cell firing, postulation of 
variables or functional features with 
no known neurological correlates, etc.) 
to correspond to a biological system. 
The proponents of this line of ap- 
Proach have maintained that, once it 
has been ‘shown how a physical 
system of any variety might be made 
to perceive and recognize stimuli, or 
perform other brainlike functions, it 
would require only a refinement or 
modification of existing principles to 
understand the working of a more 
realistic nervous System, and to elim- 
inate the shortcomings mentioned 
above. The writer takes the Position, 
on the other hand, that these short- 
comings are such that a mere refine- 
ment or improvement of the Principles 
already suggested can never account 
for biological intelligence; a difference 
in principle is clearly indicated. The 
theory of statistical separability (Cf, 
15), which is to be summarized here, 
appears to offer a solution in principle 
to all of these difficulties, 

Those theorists—Hebb (7), Milner 
(12), Eccles (4), Hayek (6)—who 
have been more directly concerned 
with the biological nervous system 
and its activity in a natural environ- 

ment, rather than with formally anal- 
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ogous machines, have generally been 
less exact in their formulations and far 
from rigorous in their analysis, so that 
it is frequently hard to assess whether 
or not the systems that they describe 
could actually work in a realistic nerv- 
ous system, and what the necessary 
and sufficient conditions might be. 
Here again, the lack of an analytic 
language comparable in proficiency to 
the Boolean algebra of the network 
analysts has been one of the main 
obstacles. The contributions of this 
group should perhaps be considered as 
suggestions of what to look for and 
investigate, rather than as finished 
theoretical systems in their own right. 
Seen from this viewpoint, the most 
suggestive work, from the standpoint 
of the following theory, is that of 
Hebb and Hayek. 

The position, elaborated by Hebb 
(7), Hayek (6), Uttley (16), and 
Ashby (1), in particular, upon which 
the theory of the perceptron is based, 


can be summarized by the following 
assumptions: 


1. The physical connections of the 
7 eewous system which are involved in 
learning and recognition are not iden- 
tical from one Organism to another. 
At birth, the Construction of the most 
Important Networks is largely random, 
subject to a minimum number of 
Senetic constraints, 

2. The original System of connected 
cells is capable of a certain amount of 
plasticity; after a period of neural 
activity, the probability that a stim- 
ulus applied to one set of cells will 
cause a response jn some other set is 


likely to change, due to some rela- 
tively long-lasting changes in the 


neurons themselves, 
3. Through exposure to a large 
sample of stimuli, those which are 


” (in some sense which 
must be defined in terms of the 
Particular physical system) will tend 
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to form pathways to the same sets of 
responding cells. Those which are 
markedly ‘dissimilar’? will tend to 
develop connections to different sets of 
responding cells. 

4. The application of positive and/ 
or negative reinforcement (or stimuli 
which serve this function) may facil- 
itate or hinder whatever formation of 
connections is currently in progress. 

5. Similarity, in such a system, is 
represented at some level of the nerv- 
ous system by a tendency of similar 
stimuli to activate the same sets of 
cells. Similarity is not a necessary 
attribute of particular formal or geo- 
metrical classes of stimuli, but de- 
pends on the physical organization of 
the perceiving system, an organiza- 
tion which evolves through interaction 
with a given environment. The 
structure of the system, as well as the 
ecology of the stimulus-environment, 
will affect, and will largely determine, 
the classes of ‘‘things’’ into which the 

` perceptual world is divided. 


THE ORGANIZATION OF A PERCEPTRON 


The organization of a typical photo- 
perceptron (a perceptron responding 
to optical patterns as stimuli) is shown 
in Fig. 1. The rules of its organiza- 


tion are as follows: 


1. Stimuli impinge on a retina of 
sensory units (S-points), which are 
assumed to respond on an all-or- 
nothing basis, in some models, or with 
a pulse amplitude or frequency pro- 
portional to the stimulus intensity, 1n 
other models. In the models con- 
sidered here, an all-or-nothing re- 
sponse will be assumed. 

2. Impulses are transmitted to a set 
of association cells (A-units) in a 


‘ . . . = 
‘projection area” (Ar). This pro 
jection area may be omitted in some 

connected 


models, where the retina is c 


directly to the association area (An). 
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Fic. 1. 


Organization of a perceptron. 


The cells in the projection area each 
receive a number of connections from 
the sensory points. The set of S- 
points transmitting impulses to a par- 
ticular A-unit will be called the origin 
points of that A-unit. These origin 
points may be either excitatory or in- 
hibitory in their effect on the A-unit. 
If the algebraic sum of excitatory and 
inhibitory impulse intensities is equal 
to or greater than the threshold (8) of 
the A-unit, then the A-unit fires, again 
on an all-or-nothing basis (or, in some 
models, which will not be considered 
here, with a frequency which depends 
on the net value of the impulses 
received). The origin points of the 
A-units in the projection area tend to 
be clustered or focalized, about some 
central point, corresponding to each 
A-unit. The number of origin points 
falls off exponentially as the retinal 
distance from the central point for 
the A-unit in question increases. 
(Such a distribution seems to be sup- 
ported by physiological evidence, and 
serves an important functional pur- 
pose in contour detection.) 

3. Between the projection area and 
the association area (Ar), connections 
are assumed to be random. That is, 
each A-unit in the Arr set receives 
some number of fibers from origin 
points in the Ar set, but these origin 
scattered at random 


points are _ at 
throughout the projection area. 
Apart from their connection distri- 
bution, the Am units are identical 


with the Ar units, and respond under 


similar conditions. 
4. The “responses,” Ra, Ra c.o 
R, are cells (or sets of cells) which 
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respond in much the same fashion as 
the A-units. Each response has a 
typically large number of origin points 
located at random in the An set. The 
set of A-units transmitting impulses 
to a particular response will be called 
the source-set for that response. 
(The source-set of a response is iden- 
tical to its set of origin points in the 
A-system.) The arrows in Fig. 1 
indicate the direction of transmission 
through the network. Note that up 
to Arr all connections are forward, and 
there is no feedback. When we come 
to the last set of connections, between 
Ar and the R-units, connections are 
established in both directions. The 
rule governing feedback connections, 
in most models of the perceptron, 
be either of the following alternati 


(a) Each response has excitatory 
feedback connections to the cells in its 
own source-set, or 

(b) Each response has inhibitory 
feedback connections to the comple- 
ment of its own Source-set (i.e., it tends 

i vity in any association 
cells which do not transmit to it), 

The first of these rules seems more 
plausible anatomically, since the R- 
units might be located in the same 
cortical area as their respective source- 
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Fic. 2A. Schematic Tepresentation of 
connections in a simple perceptron. 
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Fic. 2B. Venn diagram of the same per- 
ceptron¥ (shading shows active sets for Ri 
response). 


sets, making mutual excitation be- 
tween the R-units and the A-units of 
the appropriate source-set highly 
probable. The alternative rule (b) 
leads to a more readily analyzed sys- 
tem, however, and will therefore be 
assumed for most of the systems to be 
evaluated here. 

Figure 2 shows the organization of 
a simplified perceptron, which affords 
a convenient entry into the theory of 
statistical separability. After the 
theory has been developed for this 
simplified model, we will be in a better 
Position to discuss the advantages of 
the system in Fig. 1. The feedback 
connections shown in Fig. 2 are in- 
hibitory, and go to the complement 
of the source-set for the response from 
which they originate; consequently, 
this system is organized according to 
Rule b, above. The system shown 
here has only three stages, the first 
association stage having been elim- 
inated. Each A-unit has a set of 
randomly located origin points in the 
retina. Sucha system will form simi- 


larity concepts on the basis of coin- 
cident areas of s 
the similarity of contours or outlines. 
While su 


tage in 


e responses in 


Peres £ a system organized 
in this fashion are 


mutually exclusive. 
If Ri occurs, it will tend to inhibit R2 
and will also inhibit the source-set for 
Ra. Likewise, i R: should occur, it 
will tend to inhibit Ri. If the total 
impulse received from all the A-units 
fa One SOurce-set is stronger or more 
frequent than the impulse receive 
by the alternative (antagonistic) re- 
sponse, then the first response wil 


i as ca O C 
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tend to gain an advantage over the 
other, and will be the one which 
occurs. If such a system is to be 
capable of learning, then it must be 
possible to modify the A-units or their 
connections in such a way that stimuli 
of one class will tend to evoke a 
stronger impulse in the Rı source-set 
than in the Rə source-set, while 
stimuli of another (dissimilar) class 
will tend to evoke a stronger impulse 
in the Re source-set than in the Ri 
source-set. 

It will be assumed that the impulses 
delivered by each A-unit can be 
characterized by a value, V, which 
may be an amplitude, frequency, 
latency, or probability of completing 
transmission. If an A-unit has a high 
value, then all of its output impulses 
are considered to be more effective, 


more potent, or more likely to arrive 
at their endbulbs than impulses from 
an A-unit with a lower value. The 
value of an A-unit is considered to be 
a fairly stable characteristic, probably 
depending on the metabolic condition 
of the cell and the cell membrane, but 
it is not absolutely constant. It is 
assumed that, in general, periods of 
activity tend to increase a cell’s value, 
while the value may decay (in some 
models) with inactivity. The most 
interesting models are those in which 
cells are assumed to compete for met- 
abolic materials, the more active cells 
gaining at the expense of the less 
active cells. In such a system, if 
there is no activity, all cells will tend 
to remain in a relatively constant 
condition, and (regardless of activity) 
the net value of the system, taken in 


TABLE 1 
Comparison OF LOGICAL CHARACTERISTICS OF a, ĝ, AND Y SysTEMS 
a-System 8-System y-System 
(Uncompensated (Constant Feed (Parasitic Gain 
Gain System) System) System) 
Total value-gain of source set per rein- f; z 
forcement Nor K 0 
AV for A-units active for 1 unit of time +1 K/Nor +1 
AV for inactive A-units outside of domi- s 
nant set 0 K/Nar y 
0 =Ne 
AV for inactive A-units of dominant set 0 Var Nar 
i i Constant 
ğ Increases with number Increases with 
Mean value of A-system nf reinforcements time | 
i iffer- 0 0 
Difference between mean values of | Proportional to qig 
ource sals ences of reinforce- 
ment frequency 
(tsr — ar) 


Note: In the £ and y systems, th 
for all source-sets of which it is a member. 
Na, = Number 

Na, = Total number 

na, = Number o. 


e total value-change for any 


A-unit will be the sum of the AV’s 


of active units in source-set 
of units in source-set 
f stimuli associated to response 7; 


= Arbitrary constant 
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its entirety, will remain constant at 
all times. Three types of systems, 
which differ in their value dynamics, 
have been investigated quantitatively. 
Their principal logical features are 
compared in Table 1. _ In the alpha 
system, an active cell simply gains an 
increment of value for every impulse, 
and holds this gain indefinitely. In 
the beta system, each source-set is 
allowed a certain constant rate of gain, 
the increments being apportioned 
among the cells of the source-set in 
Proportion to their activity. In the 
gamma system, active cells gain in 
value at the expense of the inactive 
cells of their source-set, so that the 
total value of a source-set is alw 
constant, 

For purposes of 
venient to distin 
the response of t 
ulus (Fig. 3), 
phase, 


ays 


analysis, it is con- 
guish two phases in 
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Fic, 3A. Predominant phase. Inhibitory connections 
are not shown. Solid black units are active, 
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Fic. 3B. Postdominant phase, 
suppresses rival sets. 
only for Ru. 


Dominant Subset 
Inhibitory connections shown 


Fic. 3. Phases of response to a stimulus, 


F. ROSENBLATT 


plement of its own source-set, and 
thus preventing the occurrence of any 
alternative response. The response 
which happens to become dominant is 
initially random, but if the A-units are 
reinforced (i.e., if the active units are 
allowed to gain in value), then when 
the same stimulus is presented again 
at a later time, the same response will 
have a stronger tendency to recur, and 


learning can be said to have taken 
place. 


ANALYSIS OF THE PREDOMINANT 


PHASE 


The perceptrons considered here 
will always assume a fixed threshold, 
6, for the activation of the A-units. 
Such a system will be called a fixed- 
threshold model, in contrast to a con- 
tinuous transducer model, where the 
response of the A-unit is some con- 
tinuous function of the 
stimulus energy. 


In order to predict the learning 
Curves of a fixed-threshold perceptron, 
two variables have been found to be 


of primary importance. They are 
defined as follows: 


P, 


impinging 


a = the expected proportion of A- 


units activated by a stimulus of a 
given size, 

P. = the conditional probability 
that an A-unit which 


responds to 4 
will also respond 
mulus, S». 


It can be shown (Rosenblatt, 15) that 
as the size of the retina is increased, 
the number of S-points (N,) quickly 
ceases to be a Significant parameter, 
and the values of P, and P, approach 
the value that they would have for a 
retina with infinitely many points. 
For a large retina, therefore, the 
equations are as follows: 


given stimulus, Si, 
to another given sti 


E 
P= > P(e,i) 


e=0 i=ĝ 


(1) 


ee e D en p~ 


THE PERCEPTRON 393 


where 
P(e) = (*) R(t — R= 
x CG} Rit — Rr 


R = proportion of S-points activated 
by the stimulus 

x = number of excitatory connec- 
tions to each A-unit 

y = number of inhibitory connec- 
tions to each A-unit 


@ = threshold of A-units. 


(The quantities e and į are the ex- 
citatory and inhibitory components of 
the excitation received by the A-unit 
from the stimulus. If the algebraic 
sum a = e + i is equal to or greater 
than @, the A-unit is assumed to re- 
spond.) 


i] 


i 
t= 


Pa e=0 i=0—0 le 


E E Pleilalagog) 2) 


ge=0 g;=0 
(e — i — le + li + ge — gi Z 9) 
where 


P(eri,leligegi) 


e 
x (7) Ra = Re 
e 


xl ) za — L) 


( 

( 

x (i) LS = D)e 
( 
( 


i 


x = °) Gu(1 — G) e 


ge 


g= ý Ga(l — G) 


x 
gi 


and 


L = proportion of the S-points illumi- 
nated by the first stimulus, Sı, 
which are not illuminated by 


2 


G = proportion of the residual S-set 
(left over from the first stim- 
ulus) which is included in the 
second stimulus (Sə). 


The quantities R, L, and G specify the 
two stimuli and their retinal overlap. 
l, and l; are, respectively, the numbers 
of excitatory and inhibitory origin 
points “lost” by the A-unit when 
stimulus S; is replaced by S2; ge and 
g: are the numbers of excitatory and 
inhibitory origin points “gained” 
when stimulus Sı is replaced by Se. 
The summations in Equation 2 are 
between the limits indicated, subject 
to the side condition e — i — le + li 
gs — gi 2 8. 

Some of the most important char- 
acteristics of P, are illustrated in Fig. 
4, which shows Pe as a function of the 
retinal area illuminated (R). Note 
that Pa can be reduced in magnitude 
by either increasing the threshold, 0, 
or by increasing the proportion of in- 
hibitory connections (y). A compari- 
son of Fig. 4b and 4c shows that if the 
excitation is about equal to the inhibi- 
tion, the curves for Pa as a function 
of R are flattened out, so that there is 
little variation in Pa for stimuli of 
different sizes. This fact is of great 
importance for systems which require 
P, to be close to an optimum value in 
order to perform properly. 

The behavior of P. is illustrated in 
Fig. 5and6. The curves in Fig. 5 can 
be compared with those for Pe in Fig. 
4. Note that as the threshold is in- 
creased, there is an even sharper re- 
duction in the value of P, than was the 
P. also decreases as the 
ry connections 
Fig. 5, which is 


case with Pa. Pe alsi 
proportion of inhibito 
increases, as does Pa. 


394 


a) EFFECT OF INHIBITORY- 
; EXCITATORY MIXTURE. O = | 


Set a DIRT Rae iF aa 
PROPORTION OF S-POINTS ILLUMINATED 
(R) 


FIG. 4. P, 


calculated for nonoverlapping stimuli, 
illustrates the fact that P. remains 
greater than zero even when the stim- 
uli are completely disjunct, and illumi- 
nate no retinal points in common. In 
Fig. 6, the effect of varying amounts 
of overlap between the stimuli is 
shown. In all cases, the value of ips 
goes to unity as the stimuli approach 
perfect identity. For smaller stimuli 
(broken line curves), the value of P, 
is lower than for large stimuli. Simi- 
larly, the value is less for high thresh- 
olds than for low thresholds, The 
minimum value of P, will be equal to 


Pomin = (1 — L)*(1 — Gy, (3) 


In Fig. '6; Boe corresponds to the 
curve for 0 = 10. Note that under 
these conditions the probability that 
the A-unit responds to both stimuli 
(P.) is practically zero, except for 
stimuli which are quite close to 
identity. This condition can be of 
considerable help in discrimination 
learning. 


(b) VARIATION WITH @ 
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(c) VARIATION WITH M AND O 
FOR HIXTURES ABOUT 50% 
INHIBITORY. SOLID LINES 
ARE FORX =5, Y= 5. 
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as function of retinal] area illuminated. 


MATHEMATICAL ANALYSIS 
OF LEARNING IN THE 
PERCEPTRON 


The response of the perceptron in 
the predominant phase, where some 
fraction of the A-units (scattered 
throughout the System) responds to 
the stimulus, quickly gives way to the 
postdominant response, in which ac- 
tivity is limited to a single source-set, 
the other sets being suppressed. ‘Two 
possible systems have been studied for 
the determination of the “dominant” 
response, in the Postdominant phase. 
In one (the mean-discriminating sys- 
tem, or u-system), the response whose 
inputs have the greatest mean value 


responds first, gaining a slight advan- 
tage over the others, so that it quickly 
becomes dom} 


Mant. In the second 
Case (the sum-discriminating system, 
or 2-system), the response whose 1n- 
puts have the Sreatest net value gains 
an advantage, In most cases, sys- 
tems which respond to mean values 
have an advantage Over systems which 
respond to sums, since the means are 
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Fic. 5. Peas a function of R, 
for nonoverlapping stimuli. 


less influenced by random variations’ 


in P, from one source-set to another. 
In the case of the y-system (see Table 
1), however, the performance of 
the u-system and Z-system become 
identical. 

We have indicated that the percep- 
tron is expected to learn, or to form 
associations, as a result of the changes 
in value that occur as a result of the 
activity of the association cells. In 
evaluating this learning, one of two 
types of hypothetical experiments can 
be considered. In the first case; the 
perceptron is exposed to some series 
of stimulus patterns (which might be 
presented in random positions oP the 
retina) and is “forced” to give the 
desired response in each case. (This 
forcing of responses is assumed to be 
a prerogative of the experimenter. In 
experiments intended to evaluate 
trial-and-error learning, with more 
Sophisticated perceptrons, the exper!- 
menter does not force the system to 


respond in the desired fashion, but 
merely applies positive reinforcement 
when the response happens to be cor- 
rect, and negative reinforcement when 
the response is wrong.) In evaluating 
the learning which has taken place 
during this ‘“‘learning series,’’ the 
perceptron is assumed to be “frozen” 
in its current condition, no further 
value changes being allowed, and the 
same series of stimuli is presented 
again in precisely the same fashion, so 
that the stimuli fall on identical posi- 
tions on the retina. The probability 
that the perceptron will show a bias 
towards the “‘correct’”’ response (the 
one which has been previously rein- 
forced during the learning series) in 
preference to any given alternative 
response is called P», the probability 
of correct choice of response between 
two alternatives. 

In the second type of experiment, a 
learning series is presented exactly as 
before, but instead of evaluating the 
perceptron’s performance using the 
same series of stimuli which were 
shown before, a new series is pre- 
sented, in which stimuli may be drawn 
from the same classes that were previ- 
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p rienced, but are not neces- 
oS Sdentical This new test series 
is assumed to be composed of stimuli 
projected onto random retinal poSI- 
tions, which are chosen independently 
of the positions selected for the learn- 
ing series. The stimuli of the test 
series may also differ in size or rota- 
tional position from the stimuli which 
were previously experienced. In this 
case, we are interested in the prob- 
ability that the perceptron will give 
the correct response for the class of 
stimuli which is represented, regard- 
less of whether the particular stimulus 
has been seen before or not. This 
probability is called P,, the prob- 
ability of correct generalization. As 
with P,, P, is actually the probability 
that a bias will be found in favor of the 
Proper response rather than any 
alternative; only one Pair of respo 
at a time is considered, and the 
that the response bias is correct in 
pair does not mean that there 
not be other pairs in which the 
favors the wrong response. 
ability that the correct res 
be preferred over all 
designated Pp or Pe. 
In all cases investigated 
general equation gives a 
proximation to P, and Ps 
propriate constants are su 
This equation is of the form 


one 
nses 
fact 
one 
may 
bias 
The prob- 
ponse will 
alternatives is 


» a single 
close ap- 
if the ap- 
bstituted. 


P = P(Na > 0)-4(Z) (4) 


where 
P(N, > 0) = 1m P,)Ne 
$(Z) = 


normal curve integral 
from — » toZ 
and 
CMs, + Ce 
Z= 


Veatts,? + cans, 


If Rı is the “correct” response, and R, 
is the alternative response under con- 
sideration, Equation 4 is the prob- 
ability that Rı will be preferred over 
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Rə after n., stimuli have been shown 
for each of the two responses, during 
the learning period. N, is the number 
of “effective” A-units in each source- 
set; that is, the number of A-units in 
either source-set which are not con- 
nected in common to both responses. 
Those units which are connected in 
common contribute equally to both 
sides of the value balance, and con- 
sequently do not affect the net bias 
towards one response or the other. 
Na, is the number of active units in a 
source-set, which respond to the test 
stimulus, St-P(Na, > 0) is the prob- 
ability that at least one of the Ne 
effective units in the source-set of the 
correct response (designated, by con- 
vention, as the R, response) will be 
activated by the test stimulus, Si. , 
In the case of P,, the constant cs is 
always equal to zero, the other three 
constants being the same as for Bis 
e values of the four constants 
depend on the parameters of the 
physical nerve net (the perceptron) 


and also on the organization of the 
stimulus environment. 


The simplest case: 
those in which the 
stimuli drawn fro 
ment,” consistin 
points of illumi 
no attempt to 
ing to intrinsic 


S to analyze are 
perceptron is shown 
m an “ideal environ- 


etinal points, and we 
Ri as the 
, 

response for the first 500 
nd R3 for the remaining 500. 
This environment is “ideal” only in 
the sense that We speak of an ideal gas 
in physics; it is a convenient artifact 
for purposes of analysis, and does not 
lead to the best performance from the 
perceptron. In the ideal environ- 
ment situation, the constant cı is 
always equal to zero, so that, in the 
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case of P, (where cz is also zero), the 
value of Z will be zero, and P, can 
never be any better than the random 
expectation of 0.5. The evaluation 
of P, for these conditions, however, 
throws some interesting light on the 
differences between the alpha, beta, 
and gamma systems (Table 1). 

First consider the alpha system, 
which has the simplest dynamics of 
the three. In this system, whenever 


an A-unit is active for one unit of 
time, it gains one unit of value. We 
will assume an experiment, initially, 
in which N., (the number of stimuli 
associated to each response) is con- 
In this case, 


stant for all responses. 
for the sum system, 


i H 
HH ae! 
1 


10 00 
wn, (NUMBER oF STI 


-( 5) as function of wre, 
i a Ideal environm 


(we = 0, Pa = -005. 


where w = the fraction of responses 
connected to each A-unit. If the 
source-sets are disjunct, w = 1/Nr, 
where Np is the number of responses 
in the system. For the y-system, 


oa =0 

co = (1— Pa)N. 

6; = 0 (6) 
C, = 2w 


The reduction of cs to zero gives the 
p-system a definite advantage over the 
D-system. Typical learning curves 
for these systems are compared in 
Fig. 7 and 8. Figure 9 shows the 
effect of variations in Pa upon the 
performance of the system. 

If s, instead of being fixed, is 
treated as a random variable, so that 
the number of stimuli associated to 
each response is drawn separately 
from some distribution, then the per- 


10,000 100, 000 


1000 
KULI ASSOCIATED TO EACH RESPONSE) 


for discrete subsets. 
ent assumed.) 
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Fic. 8. Pw as function of Nns. 


formance of the a-system is consider- 
ably poorer than the above equations 
indicate. Under these conditions, the 
constants for the -system are 

4 =0 

&=1— P, 


g = ratio of ons, to Ths, 


Nr = number of responses in the sys- 
tem 

Na = number of A-units in the sys- 
tem 

w: = proportion of A-units common 
to Ri and R». 


For this equation (and any others jn 
which Ms, is treated as a random 
variable), it is necessary to define Noy 
in Equation 4 as the expected value 
of this variable, over the set of all 
responses. 


(For P, = :07, w = 0, 


SYHPTOTE = , 
tf} 1 


100,000 108 
Ideal environment assumed,) 


For the b-system, there is an even 
greater deficit in performance, due to 
the fact that the net value continues 
to grow regardless of what happens 
to the system. The large net values 
of the subsets activated by a stimulus 
tend to amplify small statistical differ- 
ences, causing an unreliable perform- 
ance. The Constants in this case 
(again for the 4-system) are 


a1=0 
a= (1 — PN; 
Cs = 2(P, qa x2)? 
a= 2(1 = P.)oNeN, 
In both the alpha an 
performance will b 
sum-discriminati 
mean-discriming 
gamma-system 


(8) 


d beta systems, 
e poorer for the 
ng model than for the 
ting case. In the 

» however, it can be 
shown that Pry = Paw; i.e., it makes 
no difference jn performance whether 
the -system or u-system is used. 
Moreover, the constants for the y- 
system, with variable Ms, are identical 
to the constants for the alpha p-sys- 
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(For na, = 1,000, we = 0. Ideal environment assumed.) 


tem, with äs, fixed (Equation 6). demonstrates the advantage of the 


the three systems y-system. 


The performance of 5 
Let us now replace the “ideal en- 


Fic. 9. Pr as function of Pa. 
$ is compared in Fig. 


10, which clearly 
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vironment” assumptions with a model 
for a ‘“‘differentiated environment,” jn 
which several distinguishable classes 
of stimuli are present (such as squares, 
circles, and triangles, or the letters of 
the alphabet). If we then design an 
experiment in which the stimuli asso- 
ciated to each response are drawn from 
a different class, then the learning 
curves of the perceptron are drasti- 
cally altered. The most important 
difference is that the constant cı (the 
coefficient of m, in the numerator of Z) 
is no longer equal to zero, so that 
Equation 4 now has a nonrandom 
asymptote. Moreover, in the form 
for P, (the Probability of correct 
generalization), where c& = 0, the 
quantity Z remains greater than zero, 
and P, actually approaches the same 
asymptote as P,. Thus the equation 
for the perceptron’s performance after 
infinite experience with each class of 
stimuli is identical for P, and Py: 


Pra = Poo = M- (1 — P,)%e] 


xo( 2) (9) 


This means that in the limit it makes 


no difference whether the perceptron has 
seen a particular test stimulus before or 
not; if the stimuli are drawn from a 
differentiated environment, the perform- 
ance will be equally good in either case, 
In order to evaluate the perform- 
ance of the system in a differentiated 
environment, it is necessary to define 
the quantity Peas. This quantity is 
interpreted as the expected value of 
P, between pairs of stimuli drawn at 
random from classes œ and B. In 
particular, Pen is the expected value 
of P, between members of the same 
class, and Pex. is the expected value of 
P, between an S; stimulus drawn from 
Class 1 and an Ss stimulus drawn from 
Class 2. Peiz is the expected value of 
P, between members of Class 1 and 


stimuli drawn at random from all 
other classes in the environment. X 
E Pa >: Pe sS Pas the limiting 
performance of the perceptron (Pgs) 
will be better than chance, and learn- 
ing of some response, Rj, as the proper 
“generalization response” for mem- 
bers of Class 1 should eventually 
occur. If the above inequality is not 
met, then improvement over chance 
performance may not occur, and the 
Class 2 response is likely to occur 
instead. It can be shown (15) that 
for most simple geometrical forms, 
which we ordinarily regard as “simi- 
lar,” the required inequality can be 
met, if the parameters of the system 
are properly chosen. 
he equation for P,, for the sum- 
discriminating version of an a-percep- 
tron, in a differentiated environment 
where 7,, is fixed for all responses, will 


have the following expressions for the 
four coefficients: 


GSPN PnP) j 
c&2=PN.(1— Pa) 
a= > P.(1—P,)N, 

7=1,2 


FoF (Pers)) +2 (oN p— 1) 

(PaPas) JEPEN. ve 

XLo (Pass) + (Np —1)2 

Xo (Pas) +2 (Np 1) ¢] 
a= > PN LPa P, 

r=1,2 

=o (Po ORP) 

+(wNp— 1) P= Pi, 


=e? (Pee) ] 


where 


oè (Pe) and o4(P..) represent the 
Vatlaiceof Pasand Peis meas- 


ured Over the set of possible 
test stimuli, S,, and 
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c? (Pa) and o?(Paz) represent the 
variance of Par and Pers meas- 
ured over the set of all A-units, 
aj. 

e= covariance of Pa,Pes, which is 
assumed to be negligible. 


The variances which appear in these 
expressions have not yielded, thus far, 
to a precise analysis, and can be 
treated as empirical variables to be 
determined for the classes of stimuli 
in question. If the sigma is set equal 
to half the expected value of the vari- 
able, in each case, a conservative 
estimate can be obtained. When the 
stimuli of a given class are all of the 
same shape, and uniformly distributed 
over the retina, the subscript s vari- 
ances are equal to zero. Pax) will be 
represented by the same set of coeffi- 
cients, except for c» which is equal to 
zero, as usual. 

For the mean-discriminating sys- 
tem, the coefficients are: 


id} 
i BROKEN CURVES: Ap „i. 
F SOLID CURVES: Ay 


P, and P,as function of nsr 


Fie, 11. 


. Parameters based on square- 


c= (Peu—Pess) 


c= (1— Pan) 
al 1 
ok Paseo 
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Koh Padte Ea | as 
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a= DL pay PaPa 
=p Enr amg 


=ef (Pex) E o’ (Per) 
+ @Nr—- 1) (Paz—Paz’ 
—o2(Pez)—97(Paz))J J 
Some covariance terms, which are 


considered negligible, have been omit- 
ted here. 

‘A set of typical learning curves for 
the differentiated environment model 
is shown in Fig. 11, for the mean- 
discriminating system. The param- 
eters are based on measurements fora 


Rg" Siz * 
Rn "03 Rg)" 


Hl 
RANDOM LEVEL Í 


BANE 


jj 
H H H 
1000 10,000 


rf 
iH 
7 i 


circle discrimination. 


402 F. ROSENBLATT 


square-circle discrimination problem. 
Note that the curves for P, and P, 
both approach the same asymptotes, 
as predicted. The values of these 
asymptotes can be obtained by sub- 
stituting the proper coefficients in 
Equation 9. As the number of asso- 
ciation cells in the system increases, 
the asymptotic learning limit rapidly 
approaches unity, so that for a system 
of several thousand cells, the errors in 
performance should be negligible on a 
problem as simple as the one illus- 
trated here. 

As the number of responses in the 
system increases, the performance be- 
comes progressively poorer, if every 

mutually exclusive 
One method of 
ation (described 
tt, 15) is through 
of responses. In 
f representing 100 


ach of which can be 
cognized as being 
t, and consequently 


ed by a single pair of 
mutually exclusive responses. Given 


an ideal set of binary characteristics 
(such as dark, light; tall, short; 
straight, curved; etc.), 100 stimulus 
classes could be distinguished by the 
proper configuration of only seven 
response pairs. Ina further modifica- 
tion of the system, a single response is 
capable of denoting by its activity or 
inactivity the presence or absence of 
each binary characteristic. The effi- 
ciency of such coding depends on the 
number of independently recognizable 
“earmarks” that can be found to 
differentiate stimuli. If the stimulus 
can be identified only in its entire 
and is not amenable to such analysis, 
then ultimately a separate binary 
response pair, or bit, is required to 


present or absen: 
can be represent 


denote the presence or absence of each 
stimulus class (e.g., “dog” or “not 
dog”), and nothing has been gained 
over a system where all responses are 
mutually exclusive. 


BIVALENT SYSTEMS 


In all of the systems analyzed up to 
this point, the increments of value 
gained by an active A-unit, as a result 
of reinforcement or experience, have 
always been Positive, in the sense that 


an active unit has always gained in 


its power to activate the responses 
to which it is connected. In the 
gamma-system, it is true that some 
units lose value, but these are always 
the inactive units, the active ones 
gaining in Proportion to their rate of 
activity. In a bivalent system, two 
types of reinforcement are possible 
(positive and negative), and an active 
unit may either gain or lose in value, 
depending on the momentary state of 
affairs in the system. If the positive 
and negative reinforcement can be 
controlled by the application of ex- 
ternal stimuli, they become essentially 


equivalent to “reward” and “punish- 
ment,” and can be u 


aracteristic that is 
sitive feedback to its 
“Set if the response is “on,” 
ative feedback (in the sense 
ve A-units will lose rather 
than gain in value) if the response is 
“off,” then the System is still bivalent 


in its characteristics, Such a bivalent 


>. 
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system is particularly efficient in re- 
ducing some of the bias effects (prefer- 
ence for the wrong response due to 
greater size or frequency of its asso- 
ciated stimuli) which plague the alter- 
native systems. 

Several forms of bivalent systems 
have been considered (15, Chap. VII). 
The most efficient of these has the 
following logical characteristics. 

If the system is under a state of 
positive reinforcement, then a positive 
AV is added to the values of all active 
A-units in the source-sets of “on” 
responses, while a negative AV is 
added to the active units in the source- 
sets of “off” responses. If the system 
is currently under negative reinforce- 
ment, then a negative AV is added to 
all active units in the source-set of an 
“on” response, and a positive AV is 
added to active units in an “off” 
source-set. If the source-sets are 
disjunct (which is essential for this 
system to work properly), the equa- 
tion for a bivalent y-system has the 
same coefficients as the monovalent 
a-system, for the u-case (Equation 11). 

The performance curves for this 
system are shown in Fig. 12, where the 
asymptotic generalization probability 
attainable by the system is plotted for 
the same stimulus parameters that 
were used in Fig. 11. This is the 
probability that all bits in an n-bit 
response pattern will be correct. 
Clearly, if a majority of correct Te- 
sponses is sufficient to identify a stim- 
ulus correctly, the performance will be 
better than these curves indicate. 

In a form of bivalent system which 
utilizes more plausible biological as- 
sumptions, A-units may be either 
excitatory oF inhibitory in their effect 
on connected responses. A positive 
AV in this system corresponds to the 
incrementing of an excitatory unit, 
while a negative AV corresponds to 
the incrementing of an inhibitory unit. 


Such a system performs similarly to 
the one considered above, but can be 
shown to be less efficient. 

Bivalent systems similar to those 
illustrated in Fig. 12 have been 
simulated in detail in a series of ex- 
periments with the IBM 704 computer 
at the Cornell Aeronautical Lab- 
oratory. The results have borne out 
the theory in all of its main predic- 
tions, and will be reported separately 
at a later time. 


IMPROVED PERCEPTRONS AND 
SPONTANEOUS ORGANIZATION 


The quantitative analysis of per- 
ceptron performance in the preceding 
sections has omitted any consideration 
of time as a stimulus dimension. A 
perceptron which has no capability 
for temporal pattern recognition is 
referred to as a “momentary stimulus 
perceptron.” It can be shown (15) 
that the same principles of statistical 
separability will permit the perceptron 
to distinguish velocities, sound se- 
quences, etc., provided the stimuli 
leave some temporarily persistent 
trace, such as an altered threshold, 


Fic. 12. Pe, for a bivalent binary system 
(same parameters as Fig. 11). 
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which causes the activity in the A- 
system at time t to depend to some 
degree on the activity at time £ — £: 
It has also been assumed that the 
origin points of A-units are completely 
random. It can be shown that by a 
suitable organization of origin points, 
in which the spatial distribution is 
constrained (as in the projection area 
origins shown in Fig. 1), the A-units 
will become particularly sensitive to 
the location of contours, and perform- 
ance will be improved. . 
In a recent development, Which we 
hope to report in de 
future, it has b 
values of the A 


matically rein. 
ard to whether 


‘wrong,” the 
system will tend towards a stable 
terminal Condition in which (for each 
binary response) the r 


or 


omputer, 
A Perceptron, even with 
logical level of A-uni 


units, can be show 
of interesting prop 
selective recall and selective attention, 
These properties generally depend on 
the intersection of the Source sets for 
different responses, and are elsewhere 
discussed in detail (15). By com- 
bining audio and photo inputs, it is 
possible to associate sounds, or audi- 
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tory “names” to visual objects, and to 
get the perceptron to perform such 
selective responses as are designated 
by the command “Name the object 
on the left,” or “Name the color of 
this stimulus.” 

The question may well be raised at 
this point of where the perceptron’s 
capabilities actually stop. We have 
seen that the system described is suffi- 
cient for pattern recognition, associa- 
tive learning, and such cognitive sets 
as are necessary for selective attention 
and selective recall. The system ap- 
pears to be Potentially capable of 
temporal pattern recognition, as well 
as spatial recognition, involving any 
sensory modality or combination of 
modalities. It can be shown that 
with proper reinforcement it will be 
capable of trial-and-error learning, 
and can learn to emit ordered se- 
quences of responses, provided its own 


responses are fed back through sensory 
channels, 


perceptron is 
, Without further Modification 
i r order func- 
ved in human speech, 
: and thinking? Ac- 
it of the perceptron’s 
ms to lie in the area of 
and the abstraction 
n its “symbolic be- 
Ptron shows some 


(5). 


Sponses to definite, concrete stimuli 
can be learned, €ven when the proper 
response calls for the recognition of a 
number of simultaneous qualifying 
conditions (such as naming the color 
if the stimulus is on the left, the shape 
if it is on the Tight), ‘Agiscon.as:the 
or the recognition of a 
H tween stimuli (such = 

Name the object left of the square.’ 
€ pattern that appeared 
rcle.”), however, the 


communication 


tually, the lim 
capabilities see 
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problem generally becomes excessively 
difficult for the perceptron. Statis- 
tical separability alone does not 
provide a sufficient basis for higher 
order abstraction. Some system, 
more advanced in principle than the 
perceptron, seems to be required at 


this point. 


CONCLUSIONS AND EVALUATION 


The main conclusions of the theo- 
retical study of the perceptron can be 
summarized as follows: 


1. In an environment of random 
stimuli, a system consisting of ran- 
domly connected units, subject to 
the parametric constraints discussed 
above, can learn to associate specific 
responses to specific stimuli. Even if 
many stimuli are associated to each 
response, they can still be recognized 
with a better-than-chance probability, 
although they may resemble one an- 
other closely and may activate many 
of the same sensory inputs to the 
system. 

2. In such an “ideal environment,” 
the probability of a correct response 
diminishes towards its original ran- 
dom level as the number of stimuli 
learned increases. 

3. In such an environment, no 
for generalization exists. . 

4. In a “differentiated environ- 
ment,” where each response is asso- 
ciated to a distinct class of mutually 
correlated, or “similar” stimuli, the 
probability that a learned association 
of some specific stimulus will be cor- 
rectly retained typically approaches a 
better-than-chance asymptote aS the 
number of stimuli learned by the 
system increases. This asymptote 
can be made arbitrarily close to untty 
by increasing the number of associa- 
tion cells in the system. - 

5. In the differentiated environ- 
bility that a stimulus 


basis 


ment, the proba 


which has not been scen before will be 
correctly recognized and associated to 
its appropriate class (the probability 
of correct generalization) approaches 
the same asymptote as the probability 
of a correct response to a previously 
reinforced stimulus. This asymptote 
will be better than chance if the in- 
equality Pas < Pa < Pen is met, for 
the stimulus classes in question. 

6. The performance of the system 
can be improved by the use of a con- 
tour-sensitive projection area, and by 
the use @f a binary response system, 
in which each response, or Spite, 
corresponds to some independent fea- 
ture or attribute of the stimulus. 

7. Trial-and-error learning is possi- 
ble in bivalent reinforcement systems. 

8. Temporal organizations of both 
stimulus patterns and responses can 
be learned by a system which uses 
only an extension of the original prin- 
ciples of statistical separability, with- 
out introducing any major complica- 
tions in the organization of the 
system. 

9. The memory of the perceptron 
is distributed, in the sense that any 
association may make use of a large 
proportion of the cells in the system, 
and the removal of a portion of the 
association system would not have an 
appreciable effect on the performance 
of any one discrimination or associa- 
tion, but would begin to show up as a 
general deficit in all learned asso- 
ciations. 

10. Simple cognitive sets, selective 
recall, and spontaneous recognition 
of the classes present in a given en- 
vironment are possible. The recogni- 
tion of relationships in space and time, 
however, seems to represent a limit to 
the perceptron’s ability to form cog- 


nitive abstractions. 
Psychologists, and learning theorists 


in particular, may now ask: “What 
has the present theory accomplished, 
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beyond what has already been done in 
the quantitative theories of Hull, 
Bush and Mosteller, etc., or physio- 
logical theories such as Hebb’s?” ; The 
present theory is still too primitive, of 
course, to be considered as a full- 
fledged rival of existing theories of 
human learning. Nonetheless, as a 
first approximation, its chief accom- 
plishment might be stated as follows: 
For a given mode of organization 
(a, B, or y; 3 or #; monovalent or 
bivalent) the fundamental phenomena 
of learning, perceptual discrimination, 
and generalization can be Predicted en- 
tirely from six basic physical param- 
eters, namely ; y 
x: the number of excitatory connec- 
tions per A-unit, 
y+ the number of inhib 
tions per A-unit, 
0: the expected threshold of an A- 
unit, i 
w: the proportion of R-units to 
which an A-unit is connected, 
Na: the number of A-units in the 
Npr: 


itory connec- 


system, and 


the number of R-units in the 
System. 


1S very small. 


which is meas- 
urable in its own right, independently of 
the behavioral an Perceptual Phe- 


uence of its foun- 


i variables, the 
present system goes far beyond exist- 


ing learning and behavior theories in 
three main points: parsimony, verj- 
fiability, and explanatory power and 
generality. Let us consider each of 
these points in turn. 
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1. Parsimony. Essentially all of 
the basic variables and laws used in 
this system are already present in the 
structure of physical and biological 
science, so. that we have found it 
necessary to postulate only one hy- 
Pothetical variable (or construct) 
which we have called V, the “value” 
of an association cell ; this is a variable 
which must conform to certain func- 
tional characteristics which can clearly 
be stated, and which is assumed to 


havea potentially measurable physical 
correlate. 


25 Verifiability. Previous quanti- 
tative learning theories, apparently 
without exception, have had one im- 
portant characteristic in common: 
they have all been based on measure- 
ments of behavior, in specified situa- 
tions, using these measurements (after 
theoretical manipulation) to predict 
behavior in other Situations, Such 
a procedure, in the last analysis, 


Process of curve fitting 


While 
is not necessarily 
ct sense, it shares 
difficulties of circu- 


ation behavior, 
trapolation is difficult to justify in a 


new situation, and it has been shown 


that if the basic con tants and param- 
eters are to be deri 


h ~~ > °C derived anew for any 
Situation in whi 


Such ex- 


entially irrefutable, just 


as any successfy] curve-fitting equa- 
tion is irrefutable. 


Sion, or a change in parameters, they 
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have all proved capable of adapting 
to any specific empirical data. This 
is epitomized in the increasingly com- 
mon attitude that a choice of theo- 
retical model is mostly a matter of 
personal aesthetic preference or pre- 
judice, each scientist being entitled to 
a favorite model of his own. In con- 
sidering this approach, one is reminded 
of a remark attributed ta Kistiakow- 
sky, that “given seven parameters, I 
could fit an elephant.” This is clearly 
not the case with a system in which 
thé independent variables, or param- 
eters, can be measured independently 
of the predicted behavior. In such a 
system, it is not possible to “force” 
a fit to empirical data, if the param- 


eters in current use should lead to 


improper results. In the current 
theory, a failure to fit a curve in a new 
situation would be a clear indication 
that either the theory or the empirical 
measurements are wrong. Conse- 
quently, if such a theory does hold up 
for repeated tests, we can be consider- 
ably more confident of its validity and 
of its generality than in the case of a 
theory which must be hand-tailored 
to meet each situation. 

3. Explanatory power and generality. 
The present theory, being derived 
from basic physical variables, is not 
specific to any one organism or learn- 
ing situation. It can be generalized 
in principle to cover any form of be- 
havior in any system for which the 
physical parameters are known. 
theory of learning, constructed on 
these foundations, should be consider- 
ably more powerful than any whic! 
has previously been proposed. It 
would not only tell us what behavior 
might occur in any known organism, 
but would permit the synthesis of 
behaving systems, to meet special 
requirements. Other learning theo- 
ries tend to become increasingly 


qualitative as they are generalized. 
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Thus a set of equations describing the 
effects of reward on T-maze learning 
in a white rat reduces simply to 5 
statement that rewarded behavior 
tends to occur with increasing prob- 
ability, when we attempt to generalize 
it from any species and any situation. 
The theory which has been presented 
here loses none of its precision through 
generality. 

The theory proposed by Donald 
Hebb (7) attempts to avoid these 
difficulties of behavior-based models 
by showing how psychological func- 
tioning might be derived from neuro- 
physiological theory. In his attempt 
to achieve this, Hebb’s philosophy of 

eems close to our own, and 
sen a source of inspira- 
{ f what has been pro- 
posed here. ebb, however, has 
never actually achieved a model by _ 
which behavior (or any psychological 
data) can be predicted from the physio- 
logical system. His physiology is 
more a suggestion as to the sort ot 
organic substrate which might under- 
lie behavior, and an attempt to show 
the plausibility of a bridge between 
biophysics and psychology. 

The present theory represents the | 
first actual completion of such a 
bridge. Through the use of the 
equations in the preceding sections, 
it is possible to predict learning curves 
from neurological variables, and like- 
wise, to predict neurological variables 
from learning curves. How well this 
bridge stands up to repeated crossings 
remains to be seen. Inthe meantime, 
the theory reported here clearly dem- 
onstrates the feasibility and fruitful- 
ness of a quantitative statistical ap- 
proach to the organization of cognitive 
systems. By the study of systems 
such as the perceptron, it is hoped 
that those fundamental laws of organ- 
ization which are common to all 
information handling systems, ma- 
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chines and men included, may even- 
tually be understood. 
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